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ISOLATION, CLONING AND CHARACTERIZATION OF NEW ADENO- 
ASSOCIATED VIRUS (AAV) SEROTYPES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims benefit of U.S. Provisional Application No. 60/676,604, filed 
April 29, 2005, which is hereby incorporated herein by reference in its entirety. 

BACKGROUND OF THE INVENTION 

Adeno-associated virus (AAV) is a member of the Parvoviridae, a virus family 
characterized by a single stranded linear DNA genome and a small icosahedral shaped capsid 
measuring about 20nm in diameter. AAV was first described as a contamination of tissue 
culture grown simian virus 15, a simian adenovirus and was found dependent on adenovirus 
for measurable replication. This lead to its name, adeno-associated virus, and its classification 
in the genus Dependovirus (reviewed in Hoggan et al., 1970). AAV is a common contaminant 
of adenovirus samples and has been isolated from human virus samples (AAV2, AAV3, 
AAV5), from samples of simian virus- 15 infected cells (AAV1, AAV4) as well as from stocks 
of avian (AAAV) (Bossis and Chiorini, 2003), bovine, canine and ovine adenovirus and 
laboratory adenovirus type 5 stock (AAV6). DNA spanning the entire rep-cap ORFs of AAV7 
and AAV8 was amplified by PCR from heart tissue of rhesus monkeys (Gao et al., 2002). With 
the exception of AAVs 1 and 6, all cloned AAV isolates appear to be serologically distinct. 
Nine isolates have been cloned, and recombinant viral stocks have been generated from each 
isolated virus. 

AAV appears to commonly infect humans. 50%-80% of adults in North America are 
seropositive for AAV. A steep rise in antibody response against AAV 1 -3 was observed in the 
age group between 1-10 years (Blacklow et al., 1968). AAV 2 and 3 were readily isolated from 
anal and throat specimens from children (Blacklow et al, 1967) whereas isolation from adults 
was not observed. It appears that AAV spreads primarily in the young population (Hoggan, 
1970). Prevalence of antibodies against AAV was found to be similar in Europe, Brazil and 
Japan, which suggests a global spread of AAV (Erles et al, 1999). Infection with AAV appears 
to be benign in man and laboratory animals. Currently, no disease has been associated with 
AAV infections. 

AAV2 is the best characterized adeno-associated virus and will be discussed as an 
AAV prototype. The AAV2 genome consists of a linear single stranded DNA of 4,780 . 
nucleotides. Both polarities of DNA are encapsulated by AAV with equal efficiency. The 
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AAV2 genome contains 2 open reading frames (ORF) named rep and cap. The rep ORF 
encodes the non-structural proteins that are essential for viral DNA replication, packaging and 
AAV integration. The cap ORF encodes the capsid proteins. The rep ORF is transcribed from 
promoters at map units P5 and PI 9. The rep transcripts contain an intron close to the 3' end of 
the rep ORF and can be alternatively spliced. The rep ORF is therefore expressed as 4 partially 
overlapping proteins, which were termed according to their molecular weight Rep78, 68, 52 
and 40. The cap ORF is expressed from a single promoter at P40. By alternative splicing and 
utilization of an alternative ACG start codon, cap is expressed into the capsid proteins VP 1-3 
which range in size from 65-86 kDa. VP3 is the most abundant capsid protein and constitutes 
80% of the AAV2 capsid. All viral transcripts terminate at a polyA signal at map unit 96. 

During a productive AA V2 infection, unspliced mRNAs from the p5 promoter 
encoding Rep78 are the first detectable viral transcripts. In the course of infection, expression 
from P5, PI 9 and P40 increase to 1:3:18 levels respectively. The levels of spliced transcripts 
increased to 50% for P5, PI 9 products and 90% of P40 expressed RNA (Mouw and Pintel, 
2000). 

The AAV2 genome is terminated on both sides by inverted terminal repeats (ITRs) of 
145 nucleotides (nt). 125 nt of the ITR constitute a palindrome which contains 2 internal 
palindromes of 21 nt each. The ITR can fold back on itself to generate a T-shaped hairpin with 
only 7 non-paired bases. The stem of the ITR contains a Rep binding site (RBS) and a 
sequence that is site and strand specifically cleaved by Rep - the terminal resolution site 
(TRS). The ITR is essential for AAV2 genome replication, integration and contains the 
packaging signals. 

The single-stranded AAV2 genome is packaged into a non-enveloped icosahedral 
shaped capsid of about 20-25 nm diameter. The virion consists of 26% DNA and 74% protein 
and has a density of 1 .41 g/cm\ AAV2 particles are extremely stable and can withstand 
heating to 60°C for 1 hour, extreme ph, and extraction with organic solvents. 

Rep proteins are involved in almost every step of AAV2 replication including AAV2 
genome replication, integration, and packaging. Rep78 and Rep68 possess ATPase, 3 '-5 5 
helicase, ligase and nicking activities and bind specifically to DNA. Rep52 and Rep40 appear 
to be involved in the encapsidation process and encode ATPase and 3'-5' helicase activities. 
Mutational analysis suggests a domain structure for Rep78. The N-terminal 225 aa are 
involved in DNA binding, DNA nicking and ligation. Rep78 and Rep68 recognize a GCTC 
repeat motif in the ITR as well as in a linear truncated form of the ITR (Chiorini et al., 1994) 



2 



WO 2006/119432 PCT/US2006/017157 

with similar efficiencies. Rep78 and Rep68 possess a sequence and strand specific 
endonuclease activity, which cleaves the ITR at the terminal resolution site (TRS). Rep 
endonuclease activity is dependent on nucleoside triphosphate hydrolysis and presence of 
metal cations. Rep 78 and 68 can also bind and cleave single stranded DNA in a NTP 
independent matter. In addition, Rep78 catalyzes rejoining of single stranded DNA substrates 
originating from the AAV2 origin of replication - i.e., sequences containing a rep binding and 
terminal resolution element. 

The central region of AAV2 Rep78, which represents the N-terminus of Rep52 and 
Rep40, contains the ATPase and 3'-5' helicase activities as well as nuclear localization signals. 
The helicase activity unwinds DNA-DNA and DNA-RNA duplexes, but not RJSfA-RNA. The 
ATPase activity is constitutive and independent of a DNA substrate. The C-terminus of Rep78 
contains a potential zinc-finger domain and can inhibit the cellular serine/threonine kinase 
activity of PKA as well as its homolog PRKX by pseudosubstrate inhibition. Rep68 which is 
translated from a spliced mRNA that encodes the N-terminal 529 amino acids (aa) of Rep78 
fused to 7 aa unique for Rep68, doesn't inhibit either PKA or PRKX. In addition to these 
biochemical activities, Rep can affect intracellular conditions by protein-protein interactions. 
Rep78 binds to a variety of cellular proteins including transcription factors like SP-1, high- 
mobility-group non-histone protein 1 (HMG-1) and the oncosuppressor p53. Overexpression of 
Rep results in pleiotrophic effects. Rep78 disrupts cell cycle progression and inhibits 
transformation by cellular and viral oncogenes. In susceptible cell lines, overexpression of Rep 
resulted in apoptosis and cell death. Several of Rep78 activities contribute to cytotoxicity, 
including its constitutive ATPase activity, interference with cellular gene expression and 
protein interactions. 

The first step of an AAV infection is binding to the cell surface. Receptors and 
coreceptors for AAV2 include heparan sulfate proteoglycan, fibroblast growth factor receptor- 
1, and a v ps integrins whereas N-linked 2,3-linked sialic acid is required for AAV5 binding and 
transduction (Walters et al., 2001). In HeLa cells, fluorescently labeled AAV2 particles appear 
to enter the cell via receptor-mediated endocytosis in clathrin coated pits. More than 60% of 
bound virus was internalized within 10 min after infection. Labeled AAV particles are 
observed to have escaped from the endosome, been trafficked via the cytoplasm to the cell 
nucleus and accumulated perinuclear, before entering the nucleus, probably via nuclear pore 
complex (NPC). AAV2 particles have been detected in the nucleus, suggesting that uncoating 
takes place in the nucleus (Bartlett et al., 2000; Sanlioglu et al., 2000). AAV5 is internalized in 
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HeLa cells predominantly by clathrin coated vesicles, but to a lesser degree also in noncoated 
pits. AAV particles can also be trafficked intercellularly via the Golgi apparatus (Bantel-Schaal 
et al., 2002). At least partial uncoating of AAV5 was suggested to take place before entering 
the nucleus since intact AAV5 particles could not be detected in the nucleus (Bantel-Schaal et 
al., 2002) After uncoating, the single stranded genome is converted into duplex DNA either by 
leading strand synthesis or annealing of input DNA of opposite polarity. AAV replication takes 
place within the nucleus. 

During a co-infection with a helper virus such as Adenovirus, herpes simplex virus or 
cytomegalovirus, AAV is capable of an efficient productive replication. The helper functions 
provided by Adenovirus have been studied in great detail. In human embryonic kidney 293 
cells, which constitutively express the Adenovirus El A and E1B genes, the early Adenovirus 
gene products of E2A, E4 and VA were found sufficient to allow replication of recombinant 
AAV. Allen et al. reported that efficient production of rAAV is possible in 293 cells 
transfected with only an E4orf6 expression plasmid (Allen et al., 2000). El A stimulates S 
phase entry and induces unscheduled DNA synthesis by inactivating the pRB checkpoint at the 
Gl/S border by interaction with pRB family proteins which results in the release of E2F 
(reviewed in (Ben-Israel and Kleinberger, 2002). This leads to either induction or activation of 
enzymes involved in nucleotide synthesis and DNA replication. Since unscheduled DNA 
synthesis is a strong apoptotic signal, anti-apoptotic functions are required. ElB-19k is a Bcl-2 
homolog and ElB-55k is a p53 antagonist. Both proteins have anti-apoptotic functions. E4orf6 
forms a complex with ElB-55k and results in degradation of p53. It is also reported to cause S- 
phase arrest (Ben-Israel and Kleinberger, 2002). E2A encodes a single strand DNA binding 
protein, which appears to be non-essential for DNA replication but effects gene expression 
(Chang and Shenk, 1990) (Fields 39, 40). The VA transcription unit affects AAV2 RNA 
stability and translation (Janik et al, 1989). El A has a more direct effect on AAV2 gene 
expression. The cellular transcription factor YY-1 binds and inhibits the viral P5 promoter. 
El A relieves this transcriptional block. None of the late Ad gene products have been found to 
be essential for AAV2 replication. The main function of the helper virus appears to be the 
generation of a cellular environment with active DNA replication machinery and blocked pro- 
apoptotic functions that allows high-level AAV replication rather than a direct involvement in 
AAV replication. 

While AAV is usually dependent on a helper virus for efficient replication, low level 
AAV replication was observed under conditions of genotoxic stress (Y akinoglu et al., 1988; 

4 



WO 2006/119432 PCTAJS2006/017157 

Yakobson et aL, 1 989). AAV DNA replication and particle formation was also observed in 
differentiating keratinocytes in the absence of helper virus infection (Meyers et aL, 2000). This 
demonstrates that AAV is not defective per se but rather depends on the helper virus to 
establish the favorable cellular condition and to provide factors for efficient replication 

The ability of AAV vectors to infect dividing and non-dividing cells, establish long- 
term transgene expression, and the lack of pathogenicity has made them attractive for use in 
gene therapy applications. Lack of cross competition in binding experiments suggests that each 
AAV serotype may have a distinct mechanism of cell entry. Comparison of the cap ORFs 
from different serotypes has identified blocks of conserved and divergent sequence, with most 
of the latter residing on the exterior of the virion, thus explaining the altered tissue tropism 
among serotypes (19-21, 48, 56). Vectors based on new AAV serotypes may have different 
host range and different immunological properties, thus allowing for more efficient 
transduction in certain cell types. In addition, characterization of new serotypes will aid in 
identifying viral elements required for altered tissue tropism. 

BRIEF SUMMARY OF THE INVENTION 

Provided herein are adeno-associated viruses (AAV) and vectors derived therefrom. 
Thus, the present invention relates to AAV vectors for and methods of delivering nucleic acids 
to cells of subjects. 

Additional advantages of the disclosed method and compositions will be set forth in 
part in the description which follows, and in part will be understood from the description, or 
may be learned by practice of the disclosed method and compositions. The advantages of the 
disclosed method and compositions will be realized and attained by means of the elements and 
combinations particularly pointed out in the appended claims. It is to be understood that both 
the foregoing general description and the following detailed description are exemplary and 
explanatory only and are not restrictive of the invention as claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of this 
specification, illustrate several embodiments of the disclosed method and compositions and 
together with the description, serve to explain the principles of the disclosed method and 
compositions. 

Figure 1 shows evolutionary relationship among human, non-human primate AAVs, 
and AAV contaminants detected in adenovirus stocks. The phylogenetic tree is based on 
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merged ClustalW alignments of VP1 sequences. VR numbers are identifiers of the ATCC virus 
collection. 

Figure 2 shows rAAV6, rAAV(VR-195), and rAAV(VR-355) are neutralized by pooled 
human IgGs. COS cells were transduced with a pre-incubation mixture consisting of rAAV-6, 
rAAV(VR-195), and rAAV(VR-355) expressing GFP and human IgGs at the indicated 
concentrations. Twenty-four hours post-inoculation, transduction was analyzed by flow 
cytometry and graphed as percent transduction of the untreated control. Values are means from 
three experiments; error bars represent standard deviations. 

Figure 3 shows inhibitory effect of heparin on of COS cell transduction. COS cells 
were transduced with a pre-incubation mixture consisting out of rAAV2, rAAV5, rAAV6, 
rAAV(VR-195), and rAAV(VR-355) expressing GFP and heparin at the indicated 
concentrations. 24 h post-inoculation, transduction was analyzed by flow cytometry. Values 
• are means from three experiments; error bars represent standard deviations. 

Figure 4 shows charge-dependency of AAV6, rAAV(VR-195), and rAAV(VR-355) 
transduction of COS cells. COS cells were transduced with rAAV6, rAAV(VR-195), and 
rAAV(VR-355) in medium containing the indicated concentrations of NaCl. Twenty- four hour 
postinoculation, transduction was analyzed by flow cytometry. Values are means from three 
experiments; error bars represent standard deviations 

Figure 5 shows neuraminidase treatment blocks rAAV6, rAAV(VR-195), and 
rAAV(VR-355) transduction and cell binding. Figure 5A shows gene transfer mediated by 
rAAV6, rAAV(VR-195), and rAAV(VR-355) expressing GFP in COS cells, following 
neuraminidase pretreatment. Figure 5B shows binding. Values are means from three 
experiments; error bars represent standard deviations 

Figure 6 shows effects of lectins on rAAV6, rAAV(VR-195), and rAAV(VR-355) 
transduction of COS cells. After pre-incubation with ECL, LCA, Malll or STL, COS cells 
were transduced with rAAV-6, rAAV(VR-195), and rAAV(VR-355) expressing GFP in the 
presence of the indicated lectin. Twenty-four hour post-inoculation, transduction was analyzed 
by flow cytometry. Values are means from two experiments done in duplicate; error bars 
represent standard deviations. 

Figure 7 shows transduction efficiency in human cancer cell lines. The indicated cell 
lines were transduced with 1x10 s particles of rAAV6, rAAV(VR-195), and rAAV(VR-355) 
expressing a nuclear localized GFP. Twenty-four hour post-inoculation, transduction was 
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analyzed by flow cytometry. Values are means from three experiments; error bars represent 
standard deviations. * COS cells transduction is given as transducing units/ 2xl0 7 particles. 

Figure 8 shows AAV6 competition. COS cells were transduced with a constant amount 
of rAAV6, rAAV(VR-195) or rAAV(VR-355) expressing GFP after 60 min pre-incubation 
with increasing titers of rAAV6-iacZ. Forty-eight hour post-inoculation, transduction was 
analyzed by flow cytometry. Values are means from three experiments; error bars represent 
standard deviations. 

Figure 9 shows effects of sugars on rAAV mediated transduction. Cos cells were 
transduced with GFP encoding rAAVs that were pre-incubated with various sugar monomers 
and polymers. 24 h post-inoculation, transduction was analyzed by flow cytometry. Heparin is 
the attachment factor for AAV2 that mediates initial binding of AAV2 to the cell. Extracellular 
heparin binds to the virus and blocks the attachment of the virus to the cell. None of the sugars 
tested blocked transduction of AAV-X1, AAV-X5, AAV-X25 or AAV-X26. 

Figure 10 shows effects of lectins on rAAV transduction of COS cells. After pre- 
incubation with various lectins, COS cells were transduced with rAAV-6, AAV-X1 and AAV- 
X25 expressing GFP in the presence of the indicated lectin. 24 h post-inoculation, transduction 
was analyzed by flow cytometry. All AAVs have a distinct profile on the lectin panel. 

Figure 11 shows transduction efficiency of rAAVs in cell lines. Cells were transduced 
with various rAAV serotypes encoding GFP. Two days after inoculation, cells were analyzed 
for GFP expression by flow cytometry. The transduction profiles of AAV-X1, AAV-X5, 
AAV-X25 and AAV-X26 were different from that of known AAVs. 

Figure 12 shows rAAV 12 COS cell transduction is not inhibited by heparin. COS cells 
were transduced with a pre-incubation mixture consisting out of rAAV2-GFP, or rAAV12- 
GFP expressing GFP and heparin at the indicated concentrations. 24 h post-inoculation,, 
transduction was analyzed by flow cytometry. Values are means from three experiments; error 
bars represent standard deviations. 

Figure 13 shows rAAV 12 transduction is independent of cell surface sialic acid. COS 
cells were pretreated with the V. cholera neuraminidase to remove exposed sialic acids groups 
before the cells were transduced with rAAV2, rAAV4, rAAV5 and rAAV12-GFP. Gene 
transfer was determined by flow cytometry. Values are means from three experiments; error 
bars represent standard deviations. 

Figure 14 shows rAAV 12 transduction is protease sensitive and does not require 
glycosphingolipids. COS cells were proteolytically digested with trypsin (A) or treated with 
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" the glycosphingolipids synthesis inhibitors PPMP (B) prior to transduction with rAAV2-GFP, 
rAAV12-GFP and rBAAV-GFP. Gene transfer in these cells was compared to untreated 
cultures. Values are means from three experiments; error bars represent standard deviations. 

Figure 15 shows extracellular mannosamine inhibits rAAV12 transduction. COS cells 
were transduced with a pre-incubation mixture of rAAV2-GFP, or rAAV12-GFP and 
mannosamine at the indicated concentrations. 24 h post-inoculation, transduction was analyzed 
by flow cytometry. Values are means from three experiments; error bars represent standard 
deviations. 

Figure 16 shows rAAV12 has a broad tropism. Transduction efficiency of rAAV12 was 
compared to rAAV4 in 1 3 human cancer cell lines. Cells were transduced with particles of 
either rAAV12-GFP or rAAV4-GFP. Transduction was analyzed by flow cytometry 28 h after 
virus inoculation. Values are means from three experiments; error bars represent standard 
deviations. 

Figure 17 shows rAAV12 is highly resistant to neutralization by human IgGs. rAAV2- 
GFP, rAAV6-GFP and rAAV12-GFP were incubated with pooled human IgGs prior to 
transduction of COS cells. 24 h post-inoculation, transduction was analyzed by flow 
cytometry. Transduction efficiencies relative to an untreated control were plotted. Values are 
means from three experiments; error bars represent standard deviations. 

Figure 1 8 shows rAAV12 transduces salivary glands and skeletal muscles in vivo. Male 
Balb/c mice were administered 10 9 particles of either AAV2-hEPO or AAV12-hEPO by 
retrograde ductal delivery to both submandibular glands or in both their tibialis anterior (two 
injection sites per muscle). Secretion of hEPO in mouse serum was determined 4 weeks after 
transduction by an ELISA test. Error bars represent standard deviations. 

DETAILED DESCRIPTION OF THE INVENTION 

The disclosed method and compositions may be understood more readily by reference 
to the following detailed description of particular embodiments and the Example included 
therein and to the Figures and their previous and following description. 

It is to be understood that the disclosed methods and compositions are not limited to 
specific synthetic methods, specific analytical techniques, or to particular reagents unless 
otherwise specified, and, as such, may vary. It is also to be understood that the terminology 
used herein is for the purpose of describing particular embodiments only and is not intended to 
be limiting. 
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Disclosed are materials, compositions, and components that can be used for, can be 
used in conjunction with, can be used in preparation for, or are products of the disclosed 
method and compositions. These and other materials are disclosed herein, and it is understood 
that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that 
while specific reference of each various individual and collective combinations and 
permutation of these compounds may not be explicitly disclosed, each is specifically 
contemplated and described herein. For example, if a vector is disclosed and discussed and a 
number of modifications that can be made to a number of molecules including the promoters 
and ITRs are discussed, then each and every combination and permutation of the promoters 
and ITRs, and the modifications that are possible, are specifically contemplated unless 
specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed 
as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is 
disclosed, then even if each is not individually recited, each is individually and collectively 
contemplated. Thus, is this example, each of the combinations A-E, A-F, B-D, B-E, B-F, C-D, 
C-E, and C-F are specifically contemplated and should be considered disclosed from disclosure 
of A, B, and C; D, E, and F; and the example combination A-D. Likewise, any subset or 
combination of these is also specifically contemplated and disclosed. Thus, for example, the 
sub-group of A-E, B-F, and C-E are specifically contemplated and should be considered 
disclosed from disclosure of A, B, and C; D, E, and F; and the example combination A-D. 
This concept applies to all aspects of this application including, but not limited to, steps in 
methods of making and using the disclosed compositions. Thus, if there are a variety of 
additional steps that can be performed it is understood that each of these additional steps can be 
performed , with any specific embodiment or combination of embodiments of the disclosed 
methods, and that each such combination is specifically contemplated and should be 
considered disclosed. 

It must be noted that as used herein and in the appended claims, the singular forms "a " 
"an", and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, reference to "a vector" includes a plurality of such vectors, reference to "the vector" 
is a reference to one or more vectors and equivalents thereof known to those skilled in the art, 
and so forth. 

"Optional" or "optionally" means that the subsequently described event, circumstance, 
or material may or may not occur or be present, and that the description includes instances 
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where the event, circumstance, or material occurs or is present and instances where it does not 
occur or is not present. 

Ranges may be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, also specifically 
contemplated and considered disclosed is the range from the one particular value and/or to the 
other particular value unless the context specifically indicates otherwise. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be understood 
that the particular value forms another, specifically contemplated embodiment that should be 
considered disclosed unless the context specifically indicates otherwise. It will be further 
understood that the endpoints of each of the ranges are significant both in relation to the other 
endpoint, and independently of the other endpoint unless the context specifically indicates 
otherwise- Finally, it should be understood that all of the individual values and sub-ranges of 
values contained within an explicitly disclosed range are also specifically contemplated and 
should be considered disclosed unless the context specifically indicates otherwise. The 
foregoing applies regardless of whether in particular cases some or all of these embodiments 
are explicitly disclosed. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of skill in the art to which the disclosed method and 
compositions belong. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present method and compositions, 
the particularly useful methods, devices, and materials are as described. Publications cited 
herein and the material for which they are cited are hereby specifically incorporated by 
reference. Nothing herein is to be construed, as an admission that the present invention is not 
entitled to antedate such disclosure by virtue of prior invention. No admission is made that any 
reference constitutes prior art. The discussion of references states what their authors assert, 
and applicants reserve the right to challenge the accuracy and pertinency of the cited 
documents. It will be clearly understood that, although a number of publications are referred 
to herein, such reference does not constitute an admission that any of these documents forms 
part of the common general knowledge in the art. 

Throughout the description and claims of this specification, the word "comprise" and 
variations of the word, such as "comprising" and "comprises," means "including but not 
limited to," and is not intended to exclude, for example, other additives, components, integers 
or steps. 
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Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the method and 
compositions described herein. Such equivalents are intended to be encompassed by the 
following claims. 

Provided herein are new recombinant adeno-associated viruses (AAVs) designated 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, 
AAV-X25, and AAV-X26. The term AAVX is used herein to refer generally to the new 
AAVs disclosed herein. Thus, AAVX can refer to one or more or all of AAV-X1, AAV-Xlb, 
AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26. 
The application provides the isolation, subcloning, and sequencing of the disclosed AAVXs. 
These viruses have one or more of the characteristics described below. In one embodiment, 
the compositions provided herein do not include wild-type AAV. The methods provided 
herein can use either wild-type AAV or recombinant AAV-based delivery. Thus, in one 
embodiment, the methods provided herein do not use wild-type AAV. 

Provided herein are recombinant AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, and AAV-X26 particles, recombinant 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, 
AAV-X25, and AAV-X26 vectors and recombinant AAV-X1, AAV-Xlb, AAV-X5, AAV- 
XI 9, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, and AAV-X26 virions. As 
used herein, "recombinant" refers to nucleic acids, vectors, polypeptides, or proteins that have 
been generated using DNA recombination (cloning) methods and are distinguishable from 
native or wild-type nucleic acids, vectors, polypeptides, or proteins. An AAVX particle is a 
viral particle comprising an AAVX capsid protein. A recombinant AAVX vector is a nucleic 
acid construct that comprises at least one unique, isolated nucleic acid of AAVX. The 
recombinant AAVX vector can further comprise at least one non-AAVX nucleic acid. As used 
herein, a "virion" refers to an infectious virus particle, and "infectious" refers to the ability of a 
virion to deliver genetic material to a cell. Thus, a recombinant AAVX virion is a particle 
containing a recombinant AAVX vector, wherein the particle can be either an AAVX particle 
as described herein or a non-AAVX particle. Alternatively, a recombinant AAVX virion can 
be an AAVX particle containing a recombinant vector, wherein the vector can be either an 
AAVX vector as described herein or a non-AAVX vector. An AAVX particle can further be an 
"empty particle", wherein the particle does not contain a nucleic acid, vector or plasmid, and is 
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therefore not infectious. These vectors, particles, virions, nucleic acids and polypeptides are 
described below. 

Provided herein are nucleotide sequences of AAVX genomes and vectors and particles 
derived therefrom. Specifically provided is an AAVX nucleic acid vector. Thus, provided is a 
nucleic acid vector, comprising an AAVX-specific nucleic acid or a nucleic acid encoding an 
AAVX-specific protein. The AAVX-specific nucleic acid can be a pair of AAVX inverted 
terminal repeats (ITRs) or an AAVX promoter. The nucleic acid encoding an AAVX-specific 
protein can be an AAVX capsid protein or an AAVX Rep protein. Thus, the provided AAVX 
nucleic acid vector need only have an AAVX ITR, an AAVX promoter, an AAVX Rep or an 
AAVX capsid to be an AAVX nucleic acid vector. 

The AAV ITR functions as an origin of replication for packaging of recombinant AAV 
particles. The minimum sequence necessary for this activity is the TRS site where Rep cleaves 
in order to replicate the virus. Minor modifications in an ITR are contemplated and are those 
that will not interfere with the hairpin structure formed by the ITR as described herein and 
known in the art. Furthermore, to be considered within the term it must retain the Rep binding 
site described herein. For example, the D- region of the AAV2 ITR, a single stranded region of 
the ITR (the D+ region is the reverse complement of the D- region), inboard of the TRS site, 
has been shown to bind a factor, which, depending on its phosphorylation state, correlates with 
the conversion of the AAV2 from a single stranded genome to a transcriptionally active form 
that allows for expression of the viral DNA. For example, this region is conserved between 
AAV2, AAV3, AAV4, and AAV6 but is divergent in AAV5 and BAAV. Further, as disclosed 
herein, the TRS signal (e.g., aa 176-181 of SEQ ID NO:40) and Rep Binding site (e.g., aa 195- 
210 of SEQ ID NO:40) is conserved between AAV2 and AAV-X26. 

In one aspect, the AAVX nucleic acid vector comprises an AAVX ITR. Thus, the 
AAVX nucleic acid vector can comprise an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 ITR. In another aspect, the 
AAVX nucleic acid vector comprises an ITR from any AAV serotype. Thus, the AAVX 
nucleic acid vector can comprise an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, 
AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV ITR. 

In one aspect, the AAVX nucleic acid vector comprises an AAVX promoter.. Thus, the 
promoter can be an AAVX p5, pi 9 or p40 promoter. Thus, the promoter can be an AAV-X1 
p5 promoter, an AAV-Xlb p5 promoter, an AAV-X5 p5 promoter, an AAV-X19 p5 promoter, 
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n AAV-X21 p5 promoter, an AAV-X22 p5 promoter, an AAV-X23 p5 promoter, an AAV-X24 
p5 promoter, an AAV-X25 p5 promoter, or an AAV-X26 p5 promoter. 

In another aspect, the promoter can be a promoter from any of the AAV serotypes. 
Thus, the promoter can be an AAV1 p5 promoter, an AAV2 p5 promoter, an AAV3 p5 
promoter, an AA V4 p5 promoter, AAV5 p5 promoter, an AAV6 p5 promoter, an AAAV p5 
promoter, a B AAV p5 promoter, or an AAVX p5 promoter. 

Furthermore, smaller fragments of an AAV p5 promoter that retain promoter activity 
can readily be determined by standard procedures including, for example, constructing a series 
of deletions in the p5 promoter, linking the deletion to a reporter gene, and determining 
whether the reporter gene is expressed, i.e., transcribed and/or translated. 

In yet another aspect, the promoter of the AAVX nucleic acid vector can be any desired 
promoter, selected by known considerations, such as the level of expression of a nucleic acid 
functionally linked to the promoter and the cell type in which the vector is to be used. That is, 
the promoter can be tissue/cell-specific. Promoters can be prokaryotic, eukaryotic, fungal, 
nuclear, mitochondrial, viral or plant promoters. Promoters can be exogenous or endogenous 
to the cell type being transduced by the vector. Promoters can include, for example, bacterial 
promoters, or known strong promoters such as S V40 or the inducible metallothionein 
promoter. Additionally, chimeric regulatory promoters for targeted gene expression can be 
utilized. Examples of these regulatory systems, which are known in the art, include the 
tetracycline based regulatory system which utilizes the tet transactivator protein (tTA), a 
chimeric protein containing the VP 16 activation domain fused to the tet repressor of 
Escherichia coli, the IPTG based regulatory system, the CDD based regulatory system, and the 
Ecdysone based regulatory system (No, D., et al, Proc Natl Acad Sci USA. 93(8):3346-3351 
(1996)). Other promoters include promoters derived from actin genes, immunoglobulin genes, 
cytomegalovirus (CMV), adenovirus, bovine papilloma virus, adenoviral promoters, such as 
the adenoviral major late promoter, an inducible heat shock promoter, respiratory syncytial 
virus, Rous sarcomas virus (RSV), etc. 

In one aspect, the AAVX nucleic acid vector comprises a nucleic acid encoding an 
AAVX Rep protein. Thus, the Rep protein can be an AAV-X1 Rep protein, an AAV-Xlb Rep 
protein, an AAV-X5 Rep protein, an AAV-X19 Rep protein, n AAV-X21 Rep protein, an 
AAV-X22 Rep protein, an AAV-X23 Rep protein, an AAV-X24 Rep protein, an AAV-X25 
Rep protein, or an AAV-X26 Rep protein. In another aspect, the AAVX nucleic acid vector 
comprises a nucleic acid encoding a Rep protein from any AAV serotype. Thus, the AAVX 
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nucleic acid vector can comprise a nucleic acid encoding an AAV1 Rep protein, an AAV2 Rep 
protein, an AAV3 Rep protein, an AAV4 Rep protein, an AAV5 Rep protein, an AAV6 Rep 
protein, an AAV7 Rep protein, an AAV8 Rep protein, an AAV9 Rep protein, an AAV 10 Rep 
protein, an AAV1 1 Rep protein, an AAAV Rep protein, or an BAAV Rep protein. For all 
AAV serotypes, the AAV Rep proteins can be selected from a group consisting of Rep78, 
Rep68, Rep52 and Rep40. 

The AAV-X1 Rep protein of an AAV-X1 nucleic acid vector can be encoded by a 
nucleic acid sequence comprising nucleotides 1-743, 744, 745, 746, 747, 748, 749, 750, 751, 
752, 752, 754, 755, 756, 757, 758, 759, 760, 761, 762, or 763 of SEQ ID NO: 1. The AAV-X1 
Rep protein can be encoded by a nucleic acid sequence of SEQ ID NO:48. The AAV-X1 Rep 
protein can comprise the amino acid sequence of SEQ ED NO:49. The AAV-Xlb Rep protein 
of an AAV -XI b nucleic acid vector can be encoded by a nucleic acid sequence comprising 
nucleotides 1-2016, 2017, 2018, 2019, 2020, 2021, 2022, 23 2024, 2025, 2026, 2027, 2028, 
2029, 2030, 203 1 , 2032, 2033, 2034, 2035, or 2036 of SEQ ID NO:2. The AAV-X5 Rep 
protein of an AAV-X5 nucleic acid vector can be encoded by a nucleic acid sequence 
comprising nucleotides 1-1926, 1927, 1928, 1929, 1930, 1931, 1932, 1933, 1934, 1935, 1936, 
1937, 1938, 1939, 1940, 1941, 1942, 1943, 1944, 1945, or 1946 of SEQ ID N03. The AAV- 
XI 9 Rep protein of an AAV-X19 nucleic acid vector can be encoded by a nucleic acid 
sequencexomprising nucleotides 1-743, 744, 745, 746, 747, 748, 749, 750, 751, 752,' 752, 754, 
755, 756, 757, 758, 759, 760, 761, 762, or 763 of SEQ ID NO:4. The AAV-X21 Rep protein of 
an AAV-X21 nucleic acid vector can be encoded by a nucleic acid sequence comprising 
nucleotides 1-752, 753, 754, 755, 756, 757, 758, 759. 760, 761, 762, 763, 764, 765, 766, 767, 
768, 769, 770, 771, or 772 of SEQ ID NO:5. The AAV-X22 Rep protein of an AAV-X22 
nucleic acid vector can be encoded by a nucleic acid sequence comprising nucleotides 1-748, 
749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759. 760, 761, 762, 763, 764, 765, 766, 767, 
or 768 of SEQ ID NO:6. The AAV-X23 Rep protein of an AAV-X23 nucleic acid vector can 
be encoded by a nucleic acid sequence comprising nucleotides 1-746, 747, 748, 749, 750, 751, 
752, 753, 754, 755, 756, 757, 758, 759. 760, 761, 762, 763, 764, 765, or 766 of SEQ ID 
NOS:7. The AAV-X24 Rep protein of an AAV-X24 nucleic acid vector can be encoded by a 
nucleic acid sequence comprising nucleotides 1-752, 753, 754, 755, 756, 757, 758, 759. 760, 
761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, or 772 of SEQ ID NO:8. The AAV-X25 
Rep protein of an AAV-X25 nucleic acid vector can be encoded by the a nucleic acid sequence 
comprising nucleotides 1-974, 975, 976, 977, 978, 979, 980, 981, 982, 983, 984, 985, 986, 987, 
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988, 989, 990, 991, 992, 993, or 994 of SEQ ID NO:9. The AAV-X25 Rep protein can be 
encoded by a nucleic acid sequence of SEQ ID NO:50. The AAV-X25 Rep protein can 
comprise the amino acid sequence of SEQ ID NO:51. The AAV-X26 Rep protein of an AAV- 
X26 nucleic acid vector can be encoded by the a nucleic acid sequence comprising nucleotides 
1-789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 
807, 808, or 809 of SEQ ID NO: 10. The AAV-X26 Rep protein can be encoded by a nucleic 
acid sequence of SEQ ID NO:52. The AAV-X26 Rep protein can comprise the amino acid 
sequence of SEQ ID NO:53. 

In one aspect, the AAVX nucleic acid vector comprises a nucleic acid encoding an 
AAVX capsid protein. Thus, the capsid protein can be an AAV-X1 capsid protein, an AAV- 
Xlb capsid protein, an AAV-X5 capsid protein, an AAV-X19 capsid protein, n AAV-X21 
capsid protein, an AAV-X22 capsid protein, an AAV-X23 capsid protein, an AAV-X24 capsid 
protein, an AAV-X25 capsid protein, or an AAV-X26 capsid protein. In another aspect, the 
AAVX nucleic acid vector comprises a nucleic acid encoding a capsid protein from any AAV 
serotype. Thus, the AAVX nucleic acid vector can comprise a nucleic acid encoding an AAV1 
capsid protein, an AAV2 capsid protein, an AAV3 capsid protein, an AAV4 capsid protein, an 
AAV5 capsid protein, an AAV6 capsid protein, an AAV7 capsid protein, an AAV8 capsid 
protein, an AAV9 capsid protein, an AAV 10 capsid protein, an AAV1 1 capsid protein, an 
AAAV capsid protein, or an BAAV capsid protein. For all AAV serotypes, the AAV capsid 
proteins can be selected from a group consisting of VP1, VP2 and VP3. 

As an example, the AAV -XI VP1 capsid protein of an AAV-X1 particle can have the 
amino acid sequence of SEQ ID NO:21. The AAV-Xlb VP1 capsid protein of an AAV -XI b 
particle can have the amino acid sequence of SEQ ID NOS:22. The AAV-X5 VP1 capsid 
protein of an AAV-X5 particle can have the amino acid sequence of SEQ ID NO:23. The 
AAV-X19 VP1 capsid protein of an AAV-X19 particle can have the amino acid sequence of 
SEQ ID NO:24. The AAV-X21 VP1 capsid protein of an AAV-X21 particle can have the 
amino acid sequence of SEQ ID NO:25. The AAV-X22 VP1 capsid protein of an AAV-X22 
, particle can have the amino acid sequence of SEQ ID NO:26. The AAV-X23 VP1 capsid 
protein of an AAV-X23 particle can have the amino acid sequence of SEQ ID NO:27. The 
AAV-X24 VP1 capsid protein of an AAV-X24 particle can have the amino acid sequence of 
SEQ ID NO:28. The AAV-X25 capsid protein of an AAV-X25 particle can have the amino 
acid sequence of SEQ ID NO:29. The AAV-X26 capsid protein of an AAV-X26 particle can 
have the amino acid sequence of SEQ ID NO:30. 
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The AAV -XI VP1 capsid protein of an AAV-X1 particle can be encoded by the nucleic 
acid sequence of SEQ ID NO:l 1 . The AAV-Xlb VP1 capsid protein of an AAV-Xlb particle 
can be encoded by the nucleic acid sequence of SEQ ID NO: 12. The AAV-X5 VP1 capsid 
protein of an AAV-X5 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 13. 
The AAV-X19 VP1 capsid protein of an AAV-X19 particle can be encoded by the nucleic acid 
sequence of SEQ ID NO: 14. The AAV-X21 VP1 capsid protein of an AAV-X21 particle can 
be encoded by the nucleic acid sequence of SEQ ID NO: 15. The AAV-X22 VP1 capsid protein 
of an AAV-X22 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 16. The 
AAV-X23 VP1 capsid protein of an AAV-X23 particle can be encoded by the nucleic acid 
sequence of SEQ ID NOS: 1 7. The AAV-X24 VP 1 capsid protein of an AAV-X24 particle can 
be encoded by the nucleic acid sequence of SEQ ID NO: 18. The AAV-X25 VP1 capsid protein 
of an AAV-X25 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 19. The 
AAV-X26 VP1 capsid protein of an AAV-X26 particle can be encoded by the nucleic acid 
sequence of SEQ ID NO:20. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 
herein can be corrected, for example, by using the hybridization procedure described below 
with various probes derived from the described sequences such that the coding sequence can be 
re-isolated and re-sequenced. Rapid screening for point mutations can also be achieved with 
the use of polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP). 
The corresponding amino acid sequence can then be corrected accordingly. Also, since the 
disclosed AAV serotypes AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, and AAV-X26 are disclosed herein to be present in defined 
ATCC cultures, the whole virus sequence is provided by reference to the deposit. 

The AAVX-derived vector provided herein can further comprise an exogenous nucleic 
acid functionally linked to the promoter. By "exogenous" nucleic acid is meant any nucleic 
acid that is not normally found in wild-type AAVX that can be inserted into a vector for 
transfer into a cell, tissue or organism. The exogenous nucleic acid can be a nucleic acid not 
normally found in the target cell, or it can be an extra copy or copies of a nucleic acid normally 
found in the target cell. The terms "exogenous" and "heterologous" are used herein 
interchangeably. 

By "functionally linked" is meant that the promoter can promote expression of the 
exogenous nucleic acid, as is known in the art, and can include the appropriate orientation of 
the promoter relative to the exogenous nucleic acid. Furthermore, the exogenous nucleic acid 
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preferably has all appropriate sequences for expression of the nucleic acid. The nucleic acid 
can include, for example, expression control sequences, such as an enhancer, and necessary 
information processing sites, such as ribosome binding sites, RNA splice sites, polyadenylation 
sites, arid transcriptional terminator sequences. 

The exogenous nucleic acid can encode beneficial proteins or polypeptides that replace 
missing or defective proteins required by the cell or subject into which the vector is transferred 
or can encode a cytotoxic polypeptide that can be directed, e.g., to cancer cells or other cells 
whose death would be beneficial to the subject. The exogenous nucleic acid can also encode 
antisense RNAs that can bind to, and thereby inactivate, mRNAs made by the subject that 
encode harmful proteins. The exogenous nucleic acid can also encode ribozymes that can 
effect the sequence-specific inhibition of gene expression by the cleavage of mRNAs. In one 
aspect, antisense polynucleotides can be produced from an exogenous expression cassette in an 
AAV5 vector construct where the expression cassette contains a sequence that promotes cell- 
type specific expression (Wirak et al 9 EMBO 10:289 (1991)). For general methods relating to 
antisense polynucleotides, see Antisense RNA and DNA, D. A. Melton, Ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1988). 

Examples of exogenous nucleic acids which can be administered to a cell or subject as 
part of the present AAVX vector can include, but are not limited to the following: nucleic acids 
encoding secretory and nonsecretory proteins, nucleic acids encoding therapeutic agents, such 
• as tumor necrosis factors (TNF), such as TNF-a; interferons, such as interferon-a, interferon-/?, 
and interferon-^ interleukins, such as IL-1, IL-1& and ILs -2 through -14; GM-CSF; adenosine 
deaminase; cellular growth factors, such as lymphokines; soluble CD4; Factor VIII; Factor IX; 
T-cell receptors; LDL receptor; ApoE; ApoC; alpha- 1 antitrypsin; ornithine transcarbamylase 
(OTC); cystic fibrosis transmembrane receptor (CFTR); insulin; Fc receptors for antigen 
binding domains of antibodies, such as immunoglobulins; anti-HIV decoy tar elements; and 
antisense sequences which inhibit viral replication, such as antisense sequences which inhibit 
replication of hepatitis B or hepatitis non-A, non-B virus. The nucleic acid is chosen 
considering several factors, including the cell to be transfected. Where the target cell is a 
blood cell, for example, particularly useful nucleic acids to use are those which allow the blood 
cells to exert a therapeutic effect, such as a gene encoding a clotting factor for use in treatment 
of hemophilia. Another target cell is the lung airway cell, which can be used to administer 
nucleic acids, such as those coding for the cystic fibrosis transmembrane receptor, which could 
provide a gene therapeutic treatment for cystic fibrosis. Other target cells include muscle cells 
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where useful nucleic acids, such as those encoding cytokines and growth factors, can be 
transduced and the protein the nucleic acid encodes can be expressed and secreted to exert its 
effects on other cells, tissues and organs, such as the liver. Furthermore, the nucleic acid can 
encode more than one gene product, limited only, if the nucleic acid is to be packaged in a 
capsid, by the size of nucleic acid that can be packaged. 

Furthermore, suitable nucleic acids can include those that, when transferred into a 
primary cell, such as a blood cell, cause the transferred cell to target a site in the body where 
that cell's presence would be beneficial. For example, blood cells such as TIL cells can be 
modified, such as by transfer into the cell of a Fab portion of a monoclonal antibody, to 
recognize a selected antigen. Another example would be to introduce a nucleic acid that would 
target a therapeutic blood cell to tumor cells. Nucleic acids useful in treating cancer cells 
include those encoding chemotactic factors which cause an inflammatory response at a specific 
site, thereby having a therapeutic effect. 

Cells, particularly blood cells, muscle cells, airway epithelial cells, brain cells and 
endothelial cells having such nucleic acids transferred into them can be useful in a variety of 
diseases, syndromes and conditions. For example, suitable nucleic acids include nucleic acids 
encoding soluble CD4, used in the treatment of ADDS and a-antitrypsin, used in the treatment 
of emphysema caused by a-antitrypsin deficiency. Other diseases, syndromes and conditions 
in which such cells can be useful include, for example, adenosine deaminase deficiency, sickle 
cell deficiency, brain disorders such as Alzheimer's disease, thalassemia, hemophilia, diabetes, 
phenylketonuria, growth disorders and heart diseases, such as those caused by alterations in 
cholesterol metabolism, and defects of the immune system. 

Other cells in which a gene of interest can be expressed include, but are not limited to, 
fibroblasts, neurons, retinal cells, kidney cells, lung cells, bone marrow stem cells, 
hematopoietic stem cells, retinal cells and neurons. The cells in which the gene of interest can 
be expressed can be dividing cells such as MDCK cells, BHK cells, HeLa cells, 3T3 cells, CV1 
cells, COS7 cells, HOS cells and 293 cells. The cells can also be embryonic stem cells of 
mouse, rhesus, human, bovine or sheep origin, as well as stem cells of neural, hematopoietic, 
muscle, cardiac, immune or other origin. Non-dividing cells can also be contacted with a 
particle provided herein to express a gene of interest. Such cells include, but are not limited to 
hematopoietic stem cells and embryonic stem cells that have been rendered non-dividing. 

As another example, hepatocytes can be transfected with the present vectors having 
useful nucleic acids to treat liver disease. For example, a nucleic acid encoding OTC can be 
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used to transfect hepatocytes {ex vivo and returned to the liver or in vivo) to treat congenital 
hyperammonemia, caused by an inherited deficiency in OTC. Another example is to use a 
nucleic acid encoding LDL to target hepatocytes ex vivo or in vivo to treat inherited LDL 
receptor deficiency. Such transfected hepatocytes can also be used to treat acquired infectious 
diseases, such as diseases resulting from a viral infection. For example, transduced hepatocyte 
precursors can be used to treat viral hepatitis, such as hepatitis B and non-A, non-B hepatitis, 
for example by transducing the hepatocyte precursor with a nucleic acid encoding an antisense 
RNA that inhibits viral replication. Another example includes transferring a vector provided 
herein having a nucleic acid encoding a protein, such as y-interferon, which can confer 
resistance to the hepatitis virus. 

For a procedure using transfected hepatocytes or hepatocyte precursors, hepatocyte 
precursors having a vector provided herein transferred in can be grown in tissue culture, 
removed from the tissue culture vessel, and introduced to the body, such as by a surgical 
method. In this example, the tissue would be placed directly into the liver, or into the body 
cavity in proximity to the liver, as in a transplant or graft. Alternatively, the cells can simply 
be directly injected into the liver, into the portal circulatory system, or into the spleen, from 
which the cells can be transported to the liver via the circulatory system. Furthermore, the 
cells can be attached to a support, such as microcarrier beads, which can then be introduced, 
such as by injection, into the peritoneal cavity. Once the cells are in the liver, by whatever 
means, the cells can then express the nucleic acid and/or differentiate into mature hepatocytes 
which can express the nucleic acid. 

The provided viral particles can be administered to cells, as described herein, with a 
Multiplicity of Infection (MOI) of 10. The MOI is the ratio of infectious virus particles to the 
number of cells being infected. Thus, an MOI of 0.1 results in the average inoculation of 1 
virus particle for every 10 cells. The general theory behind MOI is to introduce one infectious 
virus particle to every host cell that is present in the culture. However, more than one virus 
may infect the same cell which leaves a percentage of cells uninfected. This occurrence can be 
reduced by using a higher MOI to ensure that every cell is infected. The provided viral 
particles can therefore be administered to cells, as described herein, with a MOI of 0.01 to 100, 
such as for example 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 30, 40, 50, 
60, 70, 80, 90, 100. 

The AAVX-derived vector can include any normally occurring AAVX nucleic acid 
sequences. The AAVX-derived vector can also include sequences that are at least 80%, 81%, 
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82%, 83%, 84%, 85%, 86%, 87%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 

98.5%, 99%, 99.5% or 99.9% identical lo the AAVX nucleic acids set forth herein. Examples 

of vector constructs are provided below. 

The present AAVX vector or AAVX particle or recombinant AAVX virion can utilize 

any unique nucleic acid fragment of the AAVX disclosed herein, including the AAVX nucleic 

acids set forth in SEQ ID NOS:1-20. A unique fragment consists of a sequence that is not 

present anywhere else on a genome. A fragment is a subpart of the reference sequence, and 

thus is identical in sequence to the region of the parent nucleic acid of which it is a fragment. 

To be unique, the fragment must be of sufficient size to distinguish it from other known 

sequences, which is most readily determined by comparing any nucleic acid fragment to the 

nucleotide sequences of nucleic acids in computer databases, such as GenBank. Such 

comparative searches are standard in the art. Typically, a unique fragment useful as a primer 

or probe will be at least about 8 or 10, preferable at least 20 or 25 nucleotides in length, 

depending upon the specific nucleotide content of the sequence. Additionally, fragments can 

be, for example, at least about 30, 40, 50, 75, 100, 200 or 500 nucleotides in length and can 

encode polypeptides or be probes. The nucleic acid can be single or double stranded, 

depending upon the purpose for which it is intended. Where desired, the nucleic acid can be 

RNA. 

It is understood that as discussed herein the use of the terms "homology" and "identity" 
mean the same thing as similarity. Thus, for example, if the use of the word homology is used 
to refer to two non-natural sequences, it is understood that this is not necessarily indicating an 
evolutionary relationship between these two sequences, but rather is looking at the similarity or 
relatedness between their nucleic acid sequences. Many of the methods for determining 
homology between two evolutionarily related molecules are routinely applied to any two or 
more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of 
whether they are evolutionarily related. 

In general, it is understood that one way to define any known variants and derivatives 
or those that might arise, of the disclosed nucleic acids and polypeptides herein, is through 
defining the variants and derivatives in terms of homology to specific known sequences. In 
general, variants of nucleic acids and polypeptides herein disclosed typically have at least, 
about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 98.5%, 99%, 99.5% or 99.9% homology to the stated sequence or the native 
sequence. Those of skill in the art readily understand how to determine the homology of two 
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polypeptides or nucleic acids. For example, the homology can be calculated after aligning the 
two sequences so that the homology is at its highest level. 

Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search 
for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), 
by computerized implementations of these algorithms (GAP, BESTFIT, FAST A, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI; the BLAST algorithm of Tatusova and Madden FEMS Microbiol. Lett. 174: 
247-250 (1999) available from the National Center for Biotechnology Information 
( http://www.ncbi.nlm.nih.gov/blast/bl2seq/bl2.htmn , or by inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. 
USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989, which are 
herein incorporated by reference for at least material related to nucleic acid alignment. It is 
understood that any of the methods typically can be used and that in certain instances the 
results of these various methods may differ, but the skilled artisan understands if identity is 
found with at least one of these methods, the sequences would be said to have the stated 
identity. 

For example, as used herein, a sequence recited as having a particular percent 
homology to another sequence refers to sequences that have the recited homology as calculated 
by any one or more of the calculation methods described above. For example, a first sequence 
has 80 percent homology, as defined herein, to a second sequence if the first sequence is 
calculated to have 80 percent homology to the second sequence using the Zuker calculation 
method even if the first sequence does not have 80 percent homology to the second sequence 
as calculated by any of the other calculation methods. As another example, a first sequence 
has 80 percent homology, as defined herein, to a second sequence if the first sequence is 
calculated to have 80 percent homology to the second sequence using both the Zuker 
calculation method and the Pearson and Lipman calculation method even if the first sequence 
does not have 80 percent homology to the second sequence as calculated by the Smith and 
Waterman calculation method, the Needleman and Wunsch calculation method, the Jaeger 
calculation methods, or any of the other calculation methods. As yet another example, a first 
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sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using each of 
calculation methods (although, in practice, the different calculation methods will often result in 
different calculated homology percentages). 

Further provided herein is an AAVX capsid protein that can combine with other capsid 
proteins to form an AAVX particle to contain the disclosed vectors. Also provided herein is an 
AAVX particle, comprising an AAVX capsid protein. The capsid protein can be selected from 
a group consisting of VP1, VP2 and VP3. 

The AAV-X1 VP1 capsid protein of an AAV-X1 particle can have the amino acid 
sequence of SEQ ID NO:21. The AAV-Xlb VP1 capsid protein of an AAV-Xlb particle can 
have the amino acid sequence of SEQ ID NO:22. The AAV-X5 VP1 capsid protein of an 
AAV-X5 particle can have the amino acid sequence of SEQ ID NO:23. The AAV-X19 VP1 
capsid protein of an AAV-X19 particle can have the amino acid sequence of SEQ ID NO:24. 
The AAV-X21 VP1 capsid protein of an AAV-X21 particle can have the amino acid sequence 
of SEQ ID NO:25. The AAV-X22 VP1 capsid protein of an AAV-X22 particle can have the 
amino acid sequence of SEQ ID NO:26. The AAV-X23 VP1 capsid protein of an AAV-X23 
particle can have the amino acid sequence of SEQ ID NO:27. The AAV-X24 VP1 capsid 
protein of an AAV-X24 particle can have the amino acid sequence of SEQ ID NO:28. The 
AAV-X25 capsid protein of an AAV-X25 particle can have the amino acid sequence of SEQ 
ID NO:29. The AAV-X26 capsid protein of an AAV-X26 particle can have the amino acid 
sequence of SEQ ID NO:30. 

The AAV-X1 VP1 capsid protein of an AAV-X1 particle can be encoded by the nucleic 
acid sequence of SEQ ID NO:l 1. The AAV-Xlb VP1 capsid protein of an AAV-Xlb particle 
can be encoded by the nucleic acid sequence of SEQ ID NO: 12. The AAV-X5 VP1 capsid 
protein of an AAV-X5 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 13. 
The AAV-X19 VP1 capsid protein of an AAV-X1 9 particle can be encoded by the nucleic acid 
sequence of SEQ ID NO:14. The AAV-X21 VP1 capsid protein of an AAV-X21 particle can 
be encoded by the nucleic acid sequence of SEQ ID NO: 15. The AAV-X22 VP1 capsid protein 
of an AAV-X22 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 16. The 
AAV-X23 VP1 capsid protein of an AAV-X23 particle can be encoded by the nucleic acid 
sequence of SEQ ID NO: 17. The AAV-X24 VP1 capsid protein of an AAV-X24 particle can 
be encoded by the nucleic acid sequence of SEQ ID NO: 18. The AAV-X25 VP1 capsid protein 
of an AAV-X25 particle can be encoded by the nucleic acid sequence of SEQ ID NO: 19. The 
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AAV-X26 VP1 capsid protein of an AAV-X26 particle can be encoded by the nucleic acid 
sequence of SEQ ID NO:20. 

For example, provided is an AAVX particle, comprising all three AAVX capsid 
proteins, i.e., VP1 , VP2 and VP3. Also provided is an AAVX particle, comprising each AAVX 
capsid protein individually or in combination. Also provided is an AAVX particle comprising 
VP1 and VP3 capsid proteins, i.e., lacking any VP2 capsid proteins. Thus, an AAVX particle 
comprising an AAVX capsid protein comprises at least one AAVX capsid protein (VP1, VP2 
or VP3) or a functional fragment thereof. One of skill in the art understands that it is the non- 
conserved amino acids that are contributing to the properties of AAVX that make it distinct 
from the other serotypes. Provided therefore is a capsid protein comprising a mutation, deletion 
or substitution in the conserved regions, including, for example, a substitution with a 
homologous region from another AAV serotype.. 

An AAVX particle comprising an AAVX capsid protein can be utilized to deliver a 
nucleic acid vector to a cell, tissue or subject. For example, the herein described AAVX 
vectors can be encapsidated in an AAVX capsid-derived particle and utilized in a gene delivery 
method. Furthermore, other viral nucleic acids can be encapsidated in the AAVX particle and 
utilized in such delivery methods. For example, an AAV1-11, AAAV, or BAAV vector (e.g. 
AAV 1-1 1, AAAV, BAAV or AAVX ITR and nucleic acid of interest) can be encapsidated in 
an AAVX particle and administered. Furthermore, an AAVX chimeric capsid incorporating 
AAV1-1 1, AAAV, BAAV or AAVX capsid sequences and a different AAVX capsid 
sequences can be generated, by standard cloning methods, selecting regions from the known 
sequences of each protein as desired. For example, particularly antigenic regions of the AAVX 
capsid protein can be replaced with the corresponding region of the AAV2 capsid protein. In 
addition to chimeric capsids incorporating AAV2 capsid sequences, chimeric capsids 
incorporating AAV1, 3-8, AAAV, BAAV or AAVX capsid sequences can be generated, by 
standard cloning methods, selecting regions from the known sequences of each protein as 
desired. Alternatively a chimeric capsid can be made by the addition of a plasmid that 
expresses AAV1-1 1, AAAV, BAAV or AAVX capsid proteins at a ratio with the AAVX 
capsid expression plasmid that allows only a few capsid proteins to be incorporated into the 
AAVX particle. Thus, for example, a chimeric particle may be constructed that contains 6 
AAV2 capsid proteins and 54 AAVX capsid proteins if the complete capsid contains 60 capsid 
proteins. Methods for generating chimeric AAVs are known in the art and can be found in 
Rabinowitz JE, et al. J Virol. 2004 May;78(9):4421-32, herein incorporated by reference for 
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these methods. Examples of chimeric capsids would be to combine the VP1, 2, 3 proteins of 
AAVX and the VP1, 2, 3 proteins of AAV5 such that a new tropism would arise. The capsids 
can also be modified to alter their specific tropism by genetically altering the capsid to encode 
a specific ligand to a cell surface receptor. 

Alternatively, the capsid can be chemically modified by conjugating a ligand to a cell 
surface receptor. By genetically or chemically altering the capsids, the tropism can be 
modified to direct AAVX to a particular cell or population of cells. The capsids can also be 
altered immunologically by conjugating the capsid to an antibody that recognizes a specific 
protein on the target cell or population of cells. 

Provided are three regions in the capsid of AAVX that are on the virus surface and 
could tolerate substitution. These three regions in AAV-X1, AAV-Xlb, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, and AAV-X25 are aa 261-271, aa 450-476, and aa 
546-559. These three regions in AAV-X5 are aa 259-268, aa 448-473, and aa 543-554. These 
three regions in AAV-X26 are aa 260-274, aa 445-477, and aa 550-565. Thus, provided is an 
AAVX VP1 capsid, comprising amino acid substitutions in aa 261-271, aa 450-476, or aa 546- 
559 of SEQ ID NOS:21, 22, 24, 25, 26, 27, 28, or 29. Thus, also provided is an AAVX VP1 
capsid, comprising amino acid substitutions in aa 259-268, aa 448-473, and aa 543-554 of SEQ 
ID NO:23. Thus, also provided is an AAVX VP1 capsid, comprising amino acid substitutions 
in aa 260-274, aa 445-477, and aa 550-565 of SEQ ID NO:30. 

Other regions of the AAVX capsid could also accommodate the substitution of amino 
acids that would allow for epitope presentation on the surface of the virus. All of these regions 
would have surface exposure and the ability to support a substitution of sequence to insert the 
epitope while still allowing for capsid assembly. The substitutions can include non-AAVX 
epitopes and non-AAVX ligands. 

Because of the symmetry of the AAV particles, a substitution in one subunit of the 
capsid will appear multiple times on the capsid surface. For example the capsid is made of 
approximately 50 VP3 proteins, 5 VP 1 and 5 VP2. Therefore an epitope incorporated in the 
VP3 protein could be expressed 55 times on the surface of each particle increasing the 
likelihood of the epitope forming a stable interaction with its target. In some cases this may be 
too high of a ligand density for functional binding or this high density of epitope may interfere 
with capsid formation. The epitope density could be lowered by introducing another plasmid 
into the packaging system for production of recombinant particles and the ratio between the 
packaging plasmid with the modified VP3 protein and the wt VP3 protein altered to balance 
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the epitope density on the virus surface. Thus, the ratio between the modified VP3 and the wt 
VP3 can be 0:50 to 50:0, including, for example, 1 :49, 2:48, 3:47, 4:46, 5:45, 6:44, 7:43, 8:42, 
9:41, 10:40, 11:39, 12:38, 13:37, 14:36, 15:35, 16:34, 17:33, 18:32, 19:31,20:30,21:29, 22:28, 
23:27, 24:27, 25:25, 26:24, 27:23, 28:22, 29:21, 30:20, 31:19, 32:18, 33:17, 34:16, 35:15, 
36:14, 37:13, 38:12, 39:1 1, 40:10, 41 :9, 42:8, 43:7, 44:6, 45:5, 46:4, 47:3, 48:2, or 49:1. 

Epitopes could be incorporated into the virus capsid for the purpose of 1) altering the 
tropism of the virus 2) blocking an immune response directed at the virus 3) developing a host 
immune response to the epitope for the purpose of vaccination. 

Examples of epitopes that could be added to AAVX capsids include but are not limited to: 

LH receptor binding epitope 

RGD integrin binding epitope 

CD13 binding epitope NGRAHA SEQ ID NO:35 

The Retanef polyprotein vaccine candidate for HIV-1 

single chain antibody fragments directed against tumor cells 

Endothelial cell binding epitope SIGYPLP SEQ ID NO:36 

serpin receptor ligand, KFNKPFVFLI SEQ ID NO:37 

protective B-cell epitope hemagglutinin (HA) 91-108 from influenza HA 

NDV B-cell immunodominant epitope (IDE) spanning residues 447 to 455 

Major immunogenic epitope for parvovirus B19 (NISLDNPLENPSSLFDLVARIK SEQ ID 

NO:38) that can elicit protective antibody titers. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from VP3 
and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 
AAVX particle comprising AAVX capsid proteins and also full particles. The empty AAVX 
particles can be used to deliver, for example, antigens, drugs, proteins, or metals to cells or 
cells in a subject. Antigens can be directly incorporated into the capsid of an empty AAVX 
particle. An antigen can further be coupled via an antibody-antigen complex to the empty 
particle. Also disclosed is the coupling of drugs, proteins, or metals on the inside of the empty 
particles. 

The herein described recombinant AAVX nucleic acid derived vector can be 
encapsidated in a viral particle. The viral particle can be a parvovirus particle. The parvovirus 
particle can be a dependovirus particle. The viral particle can be an AAV particle. In 
particular, the recombinant AAVX nucleic acid derived vector can be encapsidated in an 
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AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, 
AAAV, BAAV, AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 particle, a particle comprising a portion of any of 
these capsids, or a chimeric capsid particle as described above, by standard methods using the 
appropriate capsid proteins in the encapsidation process, as long as the nucleic acid vector fits 
within the size limitation of the particle utilized. The encapsidation process itself is standard in 
the art. The AAVX replication machinery, i.e. the rep initiator proteins and other functions 
required for replication, can be utilized to produce the AAVX genome that can be packaged in 
an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, 
AAAV, BAAV, AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid. 

The recombinant AAVX virion containing a vector can also be produced by 
recombinant methods utilizing multiple plasmids. In one example, the AAV1, AAV2, AAV3, 
AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAVl 1, AAAV, BAAV, AAV-X1, 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25 , or 
AAV-X26 rep nucleic acid would be cloned into one plasmid, the AAV1, AAV2, AAV3, 
AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, BAAV, AAV-X1, 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or 
AAV-X26 ITR nucleic acid would be cloned into another plasmid and the AAVl', AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAVl 1, AAAV, BAAV, 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, 
AAV-X25, or AAV-X26 capsid nucleic acid would be cloned on another plasmid. These 
plasmids would then be introduced into cells. The cells that were efficiently transduced by all 
three plasmids, would exhibit specific integration as well as the ability to produce AAVX 
recombinant virus. Additionally, two plasmids could be used where the AAVl, AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAVl 1, AAAV, BAAV, 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, 
AAV-X25, or AAV-X26 rep nucleic acid would be cloned into one plasmid and the AAVl, 
AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAVl 1, AAAV, 
BAAV, AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV- 
X24, AAV-X25, or AAV-X26 ITR and AAVl, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, 
AAV8, AAV9, AAV10, AAV11, AAAV, BAAV, AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, 
AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 capsid would be 
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cloned into another plasmid. These plasmids would then be introduced into cells. The cells 
that were efficiently transduced by both plasmids, would exhibit specific integration as well as 
the ability to produce AAVX recombinant virus. 

An AAV-X1, AAV-Xlb, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, or 
AAV-X25 capsid can have about 98%, 98.5%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 99.5%, 
99.6%, 99.7%, 99.8%o, or 99.9% homology to the polypeptide having the amino acid sequence 
encoded by nucleotides in SEQ IDNOS:ll, 12, 14, 15, 16, 17, 18, or 19, respectively. An 
AAV-X5 capsid can have about 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, or 99% homology 
to the polypeptide having the amino acid sequence encoded by nucleotides in SEQ ID NO: 13. 
An AAV-X26 capsid can have about 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 98.5%, or 99% homology to the polypeptide having the 
amino acid sequence encoded by nucleotides in SEQ ID NO:20. 

An AAV-X1 , AAV-Xlb, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, or 
AAV-X25 capsid protein can have about 98%, 98.5%, 99%, 99.1%, 99.2%, 99.3%, 99.4%, 
99.5%, 99.6%, 99.7%, 99.8%, or 99.9% homology to the protein having the amino acid 
sequence encoded by the nucleotides set forth in SEQ ED NOS:21, 22, 24, 25, 26, 27, 28, or 29, 
respectively. An AAV-X5 capsid protein can have about 93%, 94%, 95%, 96%, 97%, 98%, 
98.5%, or 99% homology to the protein having the amino acid sequence encoded by the 
nucleotides set forth in SEQ ID NO:23. An AAV-X26 capsid protein can have about 83%, 
84%, 85%>, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 
or 99% homology to the protein having the amino acid sequence encoded by the nucleotides 
set forth in SEQ ID NO:30. 

The percent homology used to identify proteins herein, can be based on a nucleotide- 
by-nucleotide comparison or more preferable is based on a computerized algorithm as 
described herein. Variations in the amino acid sequence of the AAVX capsid protein are 
contemplated herein, as long as the resulting particle comprising an AAVX capsid protein 
remains antigenically or immunologically distinct from AAV1-1 1, AAAV, or BAAV capsid, 
as can be routinely determined by standard methods. Specifically, for example, ELISA and 
Western blots can be used to determine whether a viral particle is antigenically or 
immunologically distinct from AAV2 or the other serotypes. Furthermore, the AAVX particle 
preferably retains tissue tropism distinction from other AAVs, such as that exemplified in the 
examples herein. An AAVX chimeric particle comprising at least one AAVX coat protein 
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may have a different tissue tropism from that of an AAVX particle consisting only of AAV X 
coat proteins, but is still distinct from the tropism of an AAV2 particle. 

Provided herein is a recombinant AAVX virion, comprising an AAVX particle 
containing, i.e., encapsidating, a vector comprising a pair of AAV2, AAV3, AAV4, AAV5, 
AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, BAAV, AAV-X1, AAV-Xlb, AAV- 
X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 
inverted terminal repeats. The recombinant vector can further comprise an AAV1, AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, BAAV, 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, 
AAV-X25, or AAV-X26 Rep-encoding nucleic acid. The vector encapsidated in the particle 
can further comprise an exogenous nucleic acid inserted between the inverted terminal repeats. 

Further contemplated are chimeric recombinant ITRs that contain a rep binding site and 
a TRS site recognized by that Rep protein. By "Rep protein" is meant one or more of the Rep 
proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" could be all four of the 
Rep proteins described herein. 

. Examples of the combinations of ITR, Rep protein and Capsids that will produce 
recombinant virus include but are not limited to: 
AAVX ITR + AAVX Rep + AAVX Cap=virus 
AAV5 ITR + AAVX Rep + AAVX Cap=virus 
AAV5 ITR + AAVX Rep + AAV1 Cap=virus 
AAV5 ITR + AAVX Rep + AAV2 Cap=virus 
AAV5 ITR + AAVX Rep + AAV3 Cap=virus 
AAV5 ITR + AAVX Rep + AAV4 Cap=virus 
AAV5 ITR + AAVX Rep + AAV5 Cap=virus 
AAV5 ITR + AAVX Rep + AAV6 Cap=virus 
AAV5 ITR + AAVX Rep + AAV7 Cap=virus 
AAV5 ITR + AAVX Rep + AAV8 Cap=virus 
AAV5 ITR + AAVX Rep + AAV9 Cap=virus 
AAV5 ITR + AAVX Rep + AAV1 0 Cap=virus 
AAV5 ITR + AAVX Rep + AAV1 1 Cap=virus 
AAV5 ITR + AAVX Rep + AAAV Cap=virus 
AAV5 ITR + AAVX Rep + BAAV Cap=virus 
AAVX ITR + AAV5 Rep + AAVX Cap=virus 
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AAVX ITR + AAV5 Rep + AAV1 Cap=virus 
AAVX ITR + AAV5 Rep + AAV2 Cap=virus 
AAVX ITR + AAV5 Rep +' AAV3 Cap=virus 
AAVX ITR + AAV5 Rep + AAV4 Cap=virus 
AAVX ITR + AAV5 Rep + AAV5 Cap=virus 
AAVX ITR + AAV5 Rep + AAV6 Cap=virus 
AAVX ITR + AAV5 Rep + AAV7 Cap=virus 
AAVX ITR + AAV5 Rep + AAV8 Cap=virus 
AAVX ITR + AAV5 Rep + AAV9 Cap=virus 
AAVX ITR + AAV5 Rep + AAV 10 Cap=virus 
AAVX ITR + AAV5 Rep + AAV1 1 Cap=virus 
AAVX ITR + AAV5 Rep + AAAV Cap=virus 
AAVX ITR + AAV5 Rep + BAAV Cap=virus 
AAV1 ITR + AAV1 Rep + AAVX Cap=virus 
AAV2 ITR + AAV2 Rep + AAVX Cap=virus 
AAV3 ITR + AAV3 Rep + AAVX Cap=virus 
AAV4 ITR + AAV4 Rep + AAVX Cap=virus 
AAV5 ITR + AAV5 Rep + AAVX Cap=vims 
AAV6 ITR + AAV6 Rep + AAVX Cap=virus 
AAV7 ITR + AAV7 Rep + AAVX Cap=virus 
AAV8 ITR + AAV8 Rep + AAVX Cap=virus 
AAV9 ITR + AAV9 Rep + AAVX Cap=virus 
AAV10 ITR + AAV10 Rep + AAVX Cap=virus 
AAV1 1 ITR + AAV1 1 Rep + AAVX Cap=virus 
AAAV ITR + AAAV Rep + AAVX Cap=virus 
BAAV ITR + BAAV Rep + AAVX Cap=virus 

[Note that AAVX can be AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, or AAV-X26] 

One of skill in the art would know how to employ standard techniques to obtain the 
sequences from any of AAV1-1 1, AAAV, BAAV or AAVX in order to combine them with 
AAVX sequences. Examples of AAVX sequences that can be utilized in these constructs can 
be found herein. Examples of AAV1 sequences that can be utilized in these constructs can be 
found in GenBank under Accession No. AP063497 and these sequences are hereby 
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incorporated in their entireties by this reference. Examples of A A V2 sequences that can be 
utilized in these constructs can be found in GenBank under Accession No. AF043303 and 
these sequences are hereby incorporated in their entireties by this reference. Examples of 
AAV3 sequences that can be utilized in these constructs can be found in GenBank under 
Accession No. NC_001729 and these sequences are hereby incorporated in their entireties by 
this reference. Examples of AAV4 sequences that can be utilized in these constructs can be 
found in GenBank under Accession No. U89790 and these sequences are hereby incorporated 
in their entireties by this reference. Examples of AAV5 sequences that can be utilized in these 
constructs can be found in GenBank under Accession No. AF085716 and these sequences are 
hereby incorporated in their entireties by this reference. Examples of AAV6 sequences that 
can be utilized in these constructs can be found in GenBank under Accession No. NCJ301862 
and AF028704 and these sequences are hereby incorporated in their entireties by this reference. 
Examples of AAV7 sequences that can be utilized in these constructs can be found in GenBank 
under Accession No. AF5 13851 and these sequences are hereby incorporated in their entireties 
by this reference. Examples of AAV8 sequences that can be utilized in these constructs can be 
found in GenBank under Accession No. AF5 13852 and these sequences are hereby 
incorporated in their entireties by this reference. Examples of AAV9 sequences that can be 
utilized in these constructs can be found in GenBank under Accession No. AY530579 and 
these sequences are hereby incorporated in their entireties by this reference. Examples of 
AAV1 0 sequences that can be utilized in these constructs can be found in GenBank under 
Accession No. AY63 1965 and these sequences are hereby incorporated in their entireties by 
this reference. Examples of AAV1 1 sequences that can be utilized in these constructs can be 
found in GenBank under Accession No. AY631966 and these sequences are hereby 
incorporated in their entireties by this reference. Examples of AAAV sequences that can be 
utilized in these constructs can be found in GenBank under Accession No. AY186198 and 
these sequences are hereby incorporated in their entireties by this reference. Examples of 
BAAV sequences that can be utilized in these constructs can be found in GenBank Accession 
No. AY388617 and these sequences are hereby incorporated in their entireties by this 
reference. 

In any of the constructs described herein, inclusion of a promoter is preferred. As used 
in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of VP1, VP2, 
VP3, combinations thereof, functional fragments of any of VP1, VP2 or VP3, or chimeric 
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capsids as described herein. The ITRs of the constructs described herein, can be chimeric 
recombinant ITRs as described elsewhere in the application. 

Conjugates of recombinant or wild-type AAVX virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AAVX can be used 
as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are to 
conjugate the DNA to the virion by a bridge using poly-L-lysine or other charged molecule. 
Also contemplated are virosomes that contain AAVX structural proteins (AAVX capsid 
proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge interaction to 
introduce DNA into cells. 

Also provided herein are AAVX capsid proteins (e.g. VP1, VP2 or VP3 or 
combinations thereof), or AAVX particles consisting of AAVX capsid proteins, wherein the 
capsid proteins or particles do not contain AAV nucleic acid, vector or plasmid, and are 
therefore not infectious. These capsid proteins and "empty particles" can comprise other 
substances such as biologically active molecules (e.g., small molecules, polypeptides, or non- 
AAV nucleic acids). The substances can be conjugated to the capsid proteins or comprise a 
fusion protein with an AAVX capsid polypeptide. Alternatively, the substance can be 
incorporated within an AAVX empty particle. AAVX capsid proteins and empty particles can 
be used to deliver the substance to a target cell using the targeting ability of the capsid protein 
to achieve the desired tissue tropism. In addition, the empty particles can function to protect 
the substance from degredation or immune response. 

Also provided herein are conjugates that utilize the AAVX capsid or a unique region of 
the AAVX capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) to introduce DNA 
into cells. For example, the AAVX VP1 protein or fragment thereof, can be conjugated to a 
DNA on a plasmid, that is conjugated to a lipid. Cells can be infected using the targeting 
ability of the VP! capsid protein to achieve the desired tissue tropism, specific to the AAVX. 
AAVX VP2 and VP3 proteins can also be utilized to introduce DNA or other molecules into 
cells. By further incorporating the Rep protein and the AAV TRS into the DNA-containing 
conjugate, cells can be transduced and targeted integration can be achieved. For example, if 
AAVX specific targeted integration is desired, a conjugate composed of the AAVX VP1 
capsid, AAVX Rep or a fragment of AAVX Rep, AAVX TRS, the Rep binding site, the 
exogenous DNA of interest, and a lipid, can be utilized to achieve AAVX specific tropism and 
AAVX specific targeted integration in the genome. 
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Further provided herein are chimeric viruses where AAVX vectors can be encapsidated 
by herpes simplex virus (HSV) (Heister, T., et al. J Virol. 2002 Jul;76(14):7 163-73), 
incorporated herein for its teaching of HSV/ AAV hybrid vectors), baculovirus or other viruses 
to achieve a desired tropism associated with another virus. For example, the AAVX ITRs 
could be encapsidated by HSV and cells could be infected. Post-infection, the ITRs of AAVX 
could be acted on by AAVX Rep provided in the system or in a separate vehicle to rescue 
AAVX from the genome. Therefore, the cellular tropism of HSV can be combined with 
AAVX Rep mediated targeted integration. Other viruses that could be utilized to construct 
chimeric viruses include lenti virus, retrovirus, pseudotyped retroviral vectors and adenoviral 
vectors. 

Provided herein are isolated nucleic acids of AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, and AAV-X26. For example, 
provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO:l (AAV-X1 partial genome). Also provided is an isolated nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:2 (AAV-Xlb partial genome). Also provided is 
an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO:3 (AAV- 
X5 partial genome). Also provided is an isolated nucleic acid comprising the nucleotide 
sequence set forth in SEQ ID NO:4 (AAV-X19 partial genome). Also provided is an isolated 
nucleic acid comprising the nucleotide sequence set forth in SEQ ED NO:5 (AAV-X21 partial 
genome). Also provided is an isolated nucleic acid comprising the nucleotide sequence set 
forth in SEQ ID NO:6 (AAV-X22 partial genome). Also provided is an isolated nucleic acid 
comprising the nucleotide sequence set forth in SEQ ID NO:7 (AAV-X23 partial genome). 
Also provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ 
ID NO:8 (AAV-X24 partial genome). Also provided is an isolated nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:9 (AAV-X25 partial genome). Also provided is 
an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 10 (AAV- 
X26 partial genome). 

This nucleic acid, or unique portions thereof, can be inserted into vectors, such as 
plasmids, yeast artificial chromosomes, or other viral vector (particle), if desired, by standard 
cloning methods. Also provided is an isolated nucleic acid consisting essentially of the 
nucleotide sequence set forth in SEQ ID NOs:l-10. 

The phrase "consisting essentially of is used herein to refer to a composition that 
comprises the essential characteristics of the identified composition. By "essential" is meant 
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the characteristics that contribute to the structure or function of the disclosed molecule. Thus, 
any substitution, deletion or addition to the provided composition that does not significantly 
alter the defining characteristics of the composition are considered therein. 

For example, if an amino acid sequence X is disclosed, then a provided polypeptide 
consisting essentially of the amino acid sequence X includes, for example, conservative amino 
acid substitutions (as described below) that do not significantly alter the essential 
characteristics of the polypeptide, e.g., secondary/ tertiary structure or function of the protein. 
The provided polypeptide can further constitute a fusion protein or otherwise have additional 
N-terminal, C-terminal, or intermediate amino acid sequences, e.g., linkers or tags. "Linker", 
as used herein, is an amino acid sequences or insertion that can be used to connect or separate 
two distinct polypeptides or polypeptide fragments, wherein the linker does not otherwise 
contribute to the essential function of the composition. A polypeptide provided herein, can 
have an amino acid linker comprising, for example, the amino acids GLS, ALS, or LLA. A 
"tag", as used herein, refers to a distinct amino acid sequence that can be used to detect or 
purify the provided polypeptide, wherein the tag does not otherwise contribute to the essential 
function of the composition. The provided polypeptide can further have deleted N-terminal, C- 
terminal or intermediate amino acids that do not contribute to the essential activity of the 
polypeptide. 

As another example, if a nucleic acid X is disclosed, then a provided nucleic acid 
consisting essentially of nucleic acid sequence X, includes, for example, nucleotide 
substitutions that do not alter the amino acid sequence of the encoded polypeptide, i.e., due to 
degeneracy. If sequence X comprises introns and exons, then the provided nucleic acid can 
further be the cDNA sequence that lacks the introns but comprises the exons of sequence X. To 
the extent that specific genes within a genome are identified herein, it is further understood that 
the disclosure of a nucleic acid consisting essentially of the genome sequence would include 
fragments of the genome such as isolated sequences comprising a gene or genes within the 
genome. 

Other characteristics of nucleic acid or amino acid sequences that are not herein 
considered essential include, for example, junk DNA between genes or any identifiable 
sequence unit, e.g., promoters, enhancers, transmembrane domains, poly-adenylation 
sequences, signal sequences, etc., that when substituted or removed would be presumed by one 
skilled in the art to not significantly alter the essential characteristics of the disclosed sequence. 
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Thus, the nucleotides of SEQ ID NOS:1-10 can have minor modifications and still be 
contemplated herein. For example, modifications that do not alter the amino acid encoded by 
any given codon (such as by modification of the third, "wobble," position in a codon) can 
readily be made, and such alterations are known in the art. Furthermore, modifications that 
cause a resulting neutral (conserved) amino acid substitution of a similar amino acid can be 
made in a coding region of the genome. Additionally, modifications as described herein for 
the AAVX components, such as the ITRs, the p5 promoter, etc. are contemplated herein. 
Furthermore, modifications to regions of SEQ ID NOS:1-10, other than in the ITR, TRS, Rep 
binding site and hairpin, are likely to be tolerated without serious impact on the function of the 
nucleic acid as a recombinant vector. 

As used herein, the term "isolated" refers to a nucleic acid separated or significantly 
free from at least some of the other components of the naturally occurring organism, for 
example, the cell structural components or viral components commonly found associated with 
nucleic acids in the environment of the virus and/or other nucleic acids. The isolation of the 
native nucleic acids can be accomplished, for example, by techniques such as cell lysis 
followed by phenol plus chloroform extraction, followed by ethanol precipitation of the nucleic 
acids. The nucleic acids provided herein can be isolated from cells according to any of many 
methods well known in the art. 

As used herein, the term "nucleic acid" refers to single- or multiple-stranded molecules 
which may be DNA or RNA, or any combination thereof, including modifications to those 
nucleic acids. The nucleic acid may represent a coding strand or its complement, or any 
combination thereof. Nucleic acids may be identical in sequence to the sequences which are 
naturally occurring for any of the genes discussed herein or may include alternative codons 
which encode the same amino acid as those provided herein, including that which is found in 
the naturally occurring sequence. These nucleic acids can also be modified from their typical 
structure. Such modifications include, but are not limited to, methylated nucleic acids, the 
substitution of a non-bridging oxygen on the phosphate residue with either a sulfur (yielding 
phosphorothioate deoxynucleotides), selenium (yielding phosphorselenoate deoxynucleotides), 
or methyl groups (yielding methylphosphonate deoxynucleotides). 

Additionally provided is an isolated nucleic acid that selectively hybridizes with any 
nucleic acid disclosed herein, including the AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 genome (SEQ ID NOSrl- 
1 0) and any unique fragment thereof, including the Rep and capsid encoding sequences, 
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promoters and ITRs. Specifically, the nucleic acid can selectively or specifically hybridize to 
an isolated nucleic acid consisting of the nucleotide sequence set forth in SEQ ID NO:l, 2, 3, 
4, 5, 6, 7, 8, 9, or 1 0. By "selectively hybridizes" as used herein is meant a nucleic acid that 
hybridizes to one of the disclosed nucleic acids under sufficient stringency conditions without 
significant hybridization to a nucleic acid encoding an unrelated protein, and particularly, 
without detectably hybridizing to nucleic acids of other AAVs. Thus, a nucleic acid that 
selectively hybridizes with a nucleic acid provided herein will not selectively hybridize under 
stringent conditions with a nucleic acid encoding a different protein or the corresponding 
protein from a different serotype of the virus, and vice versa. A "specifically hybridizing" 
nucleic acid is one that hybridizes under stringent conditions to only a nucleic acid found in 
AAVX. Therefore, nucleic acids for use, for example, as primers and probes to detect or 
amplify the target nucleic acids are contemplated herein. Nucleic acid fragments that 
selectively hybridize to any given nucleic acid can be used, e.g., as primers and or probes for 
further hybridization or for amplification methods (e.g., polymerase chain reaction (PCR), 
ligase chain reaction (LCR)). Additionally, for example, a primer or probe can be designed 
that selectively hybridizes with both AAVX and a gene of interest carried within the AAVX 
vector (i.e., a chimeric nucleic acid). 

Stringency of hybridization is controlled by both temperature and salt concentration of 
either or both of the hybridization and washing steps. Typically, the stringency of 
hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the T m (the 
melting temperature at which half of the molecules dissociate from their hybridization 
partners) followed by washing at a combination of temperature and salt concentration chosen 
so that the washing temperature is about 5°C to 20°C below the T m . The temperature and salt 
conditions are readily determined empirically in preliminary experiments in which samples of 
reference DNA immobilized on filters are hybridized to a labeled nucleic acid of interest and 
then washed under conditions of different stringencies. Hybridization temperatures are 
typically higher for DNA-RNA and RNA-RNA hybridizations. The washing temperatures can 
be used as described above to achieve selective stringency, as is known in the art. (Sambrook 
et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989; Kunkel et al. Methods Enzymol 1987:154:367, 1987). 
A preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 
68°C (in aqueous solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency 
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of hybridization and washing, if desired, can be reduced accordingly as the degree of 
complementarity desired is decreased, and further, depending upon the G-C or A-T richness of 
any area wherein variability is searched for. Likewise, stringency of hybridization and 
washing, if desired, can be increased accordingly as homology desired is increased, and 
further, depending upon the G-C or A-T richness of any area wherein high homology is 
desired, all as known in the art. 

A nucleic acid that selectively hybridizes to any portion of the AAVX genome is 
contemplated herein. Therefore, a nucleic acid that selectively hybridizes to AAVX can be of 
longer length than the AAVX genome, it can be about the same length as the AAVX genome 
or it can be shorter than the AAVX genome. The length of the nucleic acid is limited on the 
shorter end of the size range only by its specificity for hybridization to AAVX, i.e., once it is 
too short, typically less than about 5 to 7 nucleotides in length, it will no longer bind 
specifically to AAVX, but rather will hybridize to numerous background nucleic acids. 
Additionally contemplated herein is a nucleic acid that has a portion that specifically 
hybridizes to AAVX and a portion that specifically hybridizes to a gene of interest inserted 
within AAVX. 

Provided is an isolated nucleic acid comprising an AAVX p5 promoter. Provided is an 
isolated nucleic acid comprising an AAVX pi 9 promoter. Provided is an isolated nucleic acid 
comprising an AAVX p40 promoter. Provided is an isolated nucleic acid comprising an AAVX 
ITR. Further provided is an isolated nucleic acid encoding an AAVX Rep protein. The AAVX 
Rep proteins are encoded by open reading frame (ORF) 1 of the AAVX genome. Examples of 
the AAV Rep proteins include Rep78, Rep68, Rep52 and Rep40. However, it is contemplated 
that the Rep nucleic acid can encode any one, two, three, or four of the four Rep proteins, in 
any order. 

Furthermore, minor modifications are contemplated in the nucleic acid, such as silent 
mutations in the coding sequences, mutations that make neutral or conservative changes in the 
encoded amino acid sequence, and mutations in regulatory regions that do not disrupt the 
expression of the gene. Examples of other minor modifications are known in the art. Further 
modifications can be made in the nucleic acid, such as to disrupt or alter expression of one or 
more of the Rep proteins in order to, for example, determine the effect of such a disruption; 
such as to mutate one or more of the Rep proteins to determine the resulting effect, etc. 

Further provided is an isolated nucleic acid encoding an AAVX Capsid protein. 
Furthermore, provided is a nucleic acid encoding each of the three AAVX capsid proteins, 
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VP1, VP2 5 and VP3. Thus, provided is an isolated nucleic acid encoding AAVX VP1, a 
nucleic acid encoding AAVX VP2, and an isolated nucleic acid encoding AAVX VP3. Thus, 
provided is an isolated nucleic acid encoding the amino acid sequence set forth in SEQ ID 
NO:21 (AAV-X1 VP1). Thus, provided is an isolated nucleic acid encoding the amino acid 
sequence set forth in SEQ ID NO:22 (AAV-Xlb VP1). Thus, provided is an isolated nucleic 
acid encoding the amino acid sequence set forth in SEQ ID NO:23 (AAV-X5 VP1). Thus, 
provided is an isolated nucleic acid encoding the amino acid sequence set forth in SEQ ID 
NO:24 (AAV-X19 VP1). Thus, provided is an isolated nucleic acid encoding the amino acid 
sequence set forth in SEQ ID NO:25 ( AAV-X2 1 VP1). Thus, provided is an isolated nucleic 
acid encoding the amino acid sequence set forth in SEQ ID NO:26 (AAV-X22 VP1). Thus, 
provided is an isolated nucleic acid encoding the amino acid sequence set forth in SEQ ID 
NO:27 (AAV-X23 VP1). Thus, provided is an isolated nucleic acid encoding the amino acid 
sequence set forth in SEQ ID NO:28 (AAV-X24 VP1). Thus, provided is an isolated nucleic 
acid encoding the amino acid sequence set forth in SEQ ID NO:29 (AAV-X25 VP1). Thus, 
provided is an isolated nucleic acid encoding the amino acid sequence set forth in SEQ ID 
NO:30 (AAV-X26 VP1). 

Also specifically provided is an isolated nucleic acid comprising SEQ ID NO:l 1 
(AAV-X1 VP1). Also specifically provided is an isolated nucleic acid comprising SEQ ID 
NO: 12 (AAV-Xlb VP1). Also specifically provided is an isolated nucleic acid comprising 
SEQ ID NO:13 (AAV-X5 VP1). Also specifically provided is an isolated nucleic acid 
comprising SEQ ID NO:14 (AAV-X19 VP1). Also specifically provided is an isolated nucleic 
acid comprising SEQ ID NO:15 (AAV-X21 VP1). Also specifically provided is an isolated 
nucleic acid comprising SEQ ID NO:16 (AAV-X22 VP1). Also specifically provided is an 
isolated nucleic acid comprising SEQ ID NO: 1 7 (AAV-X23 VP1). Also specifically provided 
is an isolated nucleic acid comprising SEQ ID NO: 18 (AAV-X24 VP1). Also specifically 
provided is an isolated nucleic acid comprising SEQ ID NO: 19 (AAV-X25 VP1). Also 
specifically provided is an isolated nucleic acid comprising SEQ ID NO:20 (AAV-X26 VP1). 

Minor modifications in the nucleotide sequences encoding the capsid, or coat, proteins 
are contemplated, as described above for other AAVX nucleic acids. However, in general, a 
modified nucleic acid encoding a capsid protein will have at least about 80%, 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 
99.5% or 99.9% or 100% homology to the capsid nucleic sequences described herein e.g., 
SEQ ID NOS:l 1-20, and the capsid polypeptide encoded therein will have overall about 80%, 
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81%, 82%, 83%, 84%, 85%, 86%, 87%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 98.5%, 99%, 99.5% or 99.9% or 100% homology with the amino acid sequence 
described herein, e.g., SEQ ID NOS:21-30. Isolated nucleic acids that selectively hybridize 
with the nucleic acids of SEQ ID NOS:l 1-20 under the conditions described above are also 
provided. 

Also provided is a cell containing one or more of the herein described nucleic acids, 
such as the AAVX genome, AAVX ORF1 and ORF2, each AAVX Rep protein gene, or each 
AAVX capsid protein gene. Such a cell can be any desired cell and can be selected based upon 
the use intended. For example, cells can include bacterial cells, yeast cells, insect cells, human 
HeLa cells and simian Cos cells as well as other human and mammalian cells and cell lines. 
Primary cultures as well as established cultures and cell lines can be used. Nucleic acids 
provided herein can be delivered into cells by any selected means, in particular depending 
upon the target cells. Many delivery means are well-known in the art. For example, 
electroporation, calcium phosphate precipitation, microinjection, cationic or anionic liposomes, 
and liposomes in combination with a nuclear localization signal peptide for delivery to the 
nucleus can be utilized, as is known in the art. Additionally, if the nucleic acids are in a viral 
particle, the cells can simply be transduced with the virion by standard means known in the art 
for AAV transduction. Small amounts of the recombinant AAVX virus can be made to infect 
cells and produce more of itself. 

Provided herein are purified AAVX polypeptides. The term "polypeptide" as used 
herein refers to a polymer of amino acids and includes full-length proteins and fragments 
thereof. Thus, "protein," polypeptide," and "peptide" are often used interchangeably herein. 
Substitutions can be selected by known parameters to be neutral {see, e.g., Robinson WE Jr, 
and Mitchell WM., AIDS 4:S151-S162 (1990)). As will be appreciated by those skilled in the 
art, also provided herein are those polypeptides having slight variations in amino acid 
sequences or other properties. Such variations may arise naturally as allelic variations (e.g., 
due to genetic polymorphism) or may be produced by human intervention {e.g., by 
mutagenesis of cloned DNA sequences), such as induced point, deletion, insertion and 
substitution mutants. Minor changes in amino acid sequence are generally preferred, such as 
conservative amino acid replacements, small internal deletions or insertions, and additions or 
deletions at the ends of the molecules. Substitutions may be designed based on, for example, 
the model of Dayhoff, et al (in Atlas of Protein Sequence and Structure 1918, Natl Biomed. 
Res. Found., Washington, D.C.). These modifications can result in changes in the amino acid 
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sequence, provide silent mutations, modify a restriction site, or provide other specific 
mutations. The location of any modifications to the polypeptide will often determine its 
impact on function. Particularly, alterations in regions non-essential to protein function will be 
tolerated with fewer effects on function. Elsewhere in the application regions of the AAVX 
proteins are described to provide guidance as to where substitutions, additions or deletions can 
be made to minimize the likelihood of disturbing the function of the variant. 

Protein variants and derivatives are well understood to those of skill in the art and in 
can involve amino acid sequence modifications. For example, amino acid sequence . 
modifications typically fall into one or more of three classes: substitutional, insertional or 
deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as 
intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be 
smaller insertions than those of amiijo or carboxyl terminal fusions, for example, on the order 
of one to four residues. Deletions are characterized by the removal of one or more amino acid 
residues from the protein sequence. Typically, no more than about from 2 to 6 residues are 
deleted at any one site within the protein molecule. These variants ordinarily are prepared by 
site specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing 
DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture. 
Techniques for making substitution mutations at predetermined sites in DNA having a known 
sequence are well known, for example Ml 3 primer mutagenesis and PCR mutagenesis. Amino 
acid substitutions are typically of single residues, but can occur at a number of different 
locations at once; insertions usually will be on the order of about from 1 to 10 amino acid 
residues; and deletions will range about from 1 to 30 residues. Deletions or insertions 
preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a 
final construct. The mutations must not place the sequence out of reading frame and preferably 
will not create complementary regions that could produce secondary mRNA structure. 
Substitutional variants are those in which at least one residue has been removed and a different 
residue inserted in its place. Such substitutions generally are made in accordance with the 
following Tables 1 and 2 and are referred to as conservative substitutions. 



TABLE 1: Amino Acid Abbreviations 


Amino Acid 


Abbreviations 


alanine 


Ala 


A 


allosoleucine 


AUe 




arginine 


Arg 


R 


asparagine 


Asn 


N 
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TABLE 1: Amino Acid Abbreviations 


Amino Acid 


Abbreviations 


aspartic acid 


Asp 


D 


cysteine 


Cys 


C 


glutamic acid 


Glu 


E 


giutamine 


Gin 


Q 


glycine 


Gly 


G 


histidine 


His 


H 


isolelucine 


lie 


I 


leucine 


Leu 


L 


lysine 


Lys 


K 


phenylalanine 


Phe 


F 


proline 


Pro 


P 


pyroglutamic acid 


pGlu 




serine 


Ser 


S 


threonine 


Thr 


T 


tyrosine 


Tyr 


Y 


tryptophan 


Trp 


W 


valine 


Val 


V 



TABLE 2: Amino Acid Substitutions 


Original Residue Exemplary Conservative 
Substitutions, others are known in the art. 


Ala 


Ser 


Arg 


Lys; Gin 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn, Lys 


Glu 


Asp 


Gly 


Pro 


His 


Asn;Gln 


lie 


Leu; Val 


Leu 


Be; Val 


Lys 


Arg; Gin 


Met 


Leu; Ee 


Phe 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Substantial changes in function or immunological identity can result from selecting 
substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone 
in the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The 
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substitutions which in general are expected to produce the greatest changes in the protein 
properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted 
for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a 
cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, 
e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this 
case, (e) by increasing the number of sites for sulfation and/or glycosylation. 

For example, the replacement of one amino acid residue with another that is 
biologically and/or chemically similar is known to those skilled in the art as a conservative 
substitution. For example, a conservative substitution would be replacing one hydrophobic 
residue for another, or one polar residue for another. The substitutions include combinations 
such as, for example, Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, 
Tyr. Such conservatively substituted variations of each explicitly disclosed sequence are 
included within the mosaic polypeptides provided herein. 

Generally, a conservative substitution is a substitution of an amino acid residue for 
another amino acid residue having similar biochemical properties. Typically, conservative 
substitutions have little to no impact on the biological activity of a resulting polypeptide. In a 
particular example, a conservative substitution is an amino acid substitution in a peptide that 
does not substantially affect the biological function of the peptide. A peptide can include one 
or more amino acid substitutions, for example 2-10 conservative substitutions, 2-5 
conservative substitutions, 4-9 conservative substitutions, such as 2, 5 or 10 conservative 
substitutions. 

For example, a conservative substitution in an AAVX VP1 peptide (such as a peptides 
encoded by SEQ ID NOS:21-30) does not substantially affect the ability of VP1 peptide to 
confer the unique tropism of the AAVX particle. A polypeptide can be produced to contain one 
or more conservative substitutions by manipulating the nucleotide sequence that encodes that 
polypeptide using, for example, standard procedures such as site-directed mutagenesis or PCR. 
Alternatively, a polypeptide can be produced to contain one or more conservative substitutions 
by using standard peptide synthesis methods. An alanine scan can be used to identify which 
amino acid residues in a protein can tolerate an amino acid substitution. In one example, the 
biological activity of the protein is not decreased by more than 25%, for example not more 
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than 20%, for example not more than 10%, when an alanine, or other conservative amino acid 

(such as those listed below), is substituted for one or more native amino acids. 

Examples of amino acids which can be substituted for an original amino acid in a 
protein and which are regarded as conservative substitutions include, but are not limited to: Ser 
for Ala; Lys for Arg; Gin or His for Asn; Glu for Asp; Ser for Cys; Asn for Gin; Asp for Glu; 
Pro for Gly; Asn or Gin for His; Leu or Val for He; He or Val for Leu; Arg or Gin for Lys; Leu 
or lie for Met; Met, Leu or Tyr for Phe; Thr for Ser; Ser for Thr; Tyr for Tip; Trp or Phe for 
Tyr; and lie or Leu for Val. 

Further information about conservative substitutions can be found in, among other 
locations in, Ben-Bassat et al., (J. Bacterid.- 169:751-7, 1987), O'Regan et al., (Gene 77:237- 
51, 1989), Sahin-Toth et al., (Protein Sci. 3:240-7, 1994), Hochuli et al., (Bio/Technology 
6:1321-5, 1988) and in standard textbooks of genetics and molecular biology. 

Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other 
labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, 
e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one 
by glutaminyl or histidyl residues. 

Certain post-translational derivatizations are the result of the action of recombinant host 
cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post- 
translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, 
these residues are deamidated under mildly acidic conditions. Other post-translational 
modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups 
of seryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and 
histidine side chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W. H. 
Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in 
some instances, amidation of the C-terminal carboxyl. 

It is understood that there are numerous amino acid and peptide analogs which can be 
incorporated into the disclosed compositions. For example, there are numerous D amino acids 
or amino acids which have a different functional substituent then the amino acids shown in 
Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are disclosed, 
as well as the stereo isomers of peptide analogs. These amino acids can readily be 
incorporated into polypeptide chains by charging tRNA molecules with the amino acid of 
choice and engineering genetic constructs that utilize, for example, amber codons, to insert the 
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analog amino acid into a peptide chain in a site specific way (Thorson et al., Methods in 
Molec. Biol. 77:43-73 (1991), Zoller, Current Opinion in Biotechnology, 3:348-354 (1992); 
Ibba, Biotechnology & Genetic Enginerring Reviews 13:197-216 (1995), Cahill et aL, TIBS, 
14(10):400-403 (1989); Benner, TIB Tech, 12:158-163 (1994); Ibba and Hennecke, 
Bio/technology, 12:678-682 (1994) all of which are herein incorporated by reference at least 
for material related to amino acid analogs). 

Molecules can be produced that resemble peptides, but which are not connected via a 
natural peptide linkage. For example, linkages for amino acids or amino acid analogs can 
include CH2NH-, -CH2S-, --CH2--CH2 -, ~CH=CH~ (cis and trans), -COCH2 ~, - 
CH(OH)CH2--, and -CHH2SO— (These and others can be found in Spatola, A. F. in 
Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., 
Marcel Dekker, New York, p. 267 (1983); Spatola, A. F., Vega Data (March 1983), Vol. 1, 
Issue 3, Peptide Backbone Modifications (general review); Morley, Trends Pharm Sci (1980) 
pp. 463-468; Hudson, D. etaL, Int J Pept Prot Res 14:177-185 (1979) (-CH2NH-, CH2CH2-- 
); Spatola et al. Life Sci 38:1243-1249 (1986) (~CH H2--S); Hann J. Chem. Soc Perkin Trans. 
I 307-314 (1982) (-CH-CH-, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 
(1980) (-COCH2-); Jennings-White et aL Tetrahedron Lett 23:2533 (1982) (-COCH2-); 
Szelke et al. European Appln, EP 45665 CA (1982): 97:39405 (1982) (~CH(OH)CH2~); 
Holladay et al. Tetrahedron. Lett 24:4401-4404 (1983) (~C(OH)CH2--); and Hruby Life Sci 
31 .T89-199 (1982) (--CH2--S-); each of which is incorporated herein by reference. A 
particularly preferred non-peptide linkage is --CH2NH-. It is understood that peptide analogs 
can have more than one atom between the bond atoms, such as b-alanine, g-aminobutyric acid, 
and the like. 

Amino acid analogs and analogs and peptide analogs often have enhanced or desirable 
properties, such as, more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity 
(e.g., a broad-spectrum of biological activities), reduced antigenicity, and others. 

D-amino acids can be used to generate more stable peptides, because D amino acids are 
not recognized by peptidases and such. Systematic substitution of one or more amino acids of 
a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of L- 
lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize 
or attach two or more peptides together. This can be beneficial to constrain peptides into 
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particular conformations. (Rizo and Gierasch Ann. Rev. Biochem. 61 :387 (1992), 
incorporated herein by reference). 

A polypeptide provided herein can be readily obtained by any of several means. For 
example, the polypeptide of interest can be synthesized chemically by standard methods. 
Additionally, the coding regions of the genes can be recombinantly expressed and the resulting 
polypeptide isolated by standard methods. Furthermore, an antibody specific for the resulting 
polypeptide can be raised by standard methods (see, e.g., Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1 988), 
and the protein can be isolated from a cell expressing the nucleic acid encoding the polypeptide 
by selective hybridization with the antibody. This protein can be purified to the extent desired 
by standard methods of protein purification (see, e.g., Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
1989). 

An antigenic or immunoreactive fragment of the provided compositions and methods is 
typically an amino acid sequence of at least about 5 consecutive amino acids, and it can be 
derived from the AAVX polypeptide amino acid sequence. An antigenic AAVX fragment is 
any fragment unique to the AAVX protein, as described herein, against which an AAVX- 
specific antibody can be raised, by standard methods. Thus, the resulting antibody-antigen 
reaction should be specific for AAVX. 

By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
an AAVX rep gene that is of sufficient length to be found only in the Rep polypeptide. 
Substitutions and modifications of the amino acid sequence can be made as described herein 
and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. Typically, to be unique, a polypeptide fragment provided herein will be at least 
about 5 amino acids in length; however, unique fragments can be 6, 7, 8, 9, 10, 20, 30, 40, 50, 
60, 70, 80, 90, 100 or more amino acids in length. A unique polypeptide will typically 
comprise such a unique fragment; however, a unique polypeptide can also be determined by its 
overall homology. A unique polypeptide can be 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 
100 or more amino acids in length. Uniqueness of a polypeptide fragment can readily be 
determined by standard methods such as searches of computer databases of known peptide or 
nucleic acid sequences or by hybridization studies to the nucleic acid encoding the protein or to 
the protein itself, as known in the art. The uniqueness of a polypeptide fragment can also be 
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determined immunologically as well as functionally. Uniqueness can be simply determined in 

an amino acid-by- amino acid comparison of the polypeptides. 

Provided is an isolated AAVX Rep protein. An AAVX Rep polypeptide is encoded by 
ORF1 of AAVX. Also provided is each individual AAVX Rep protein. Provided is an isolated 
polypeptide, comprising AAVX Rep 52, or a unique fragment thereof. Provided is an isolated 
polypeptide, comprising AAV Rep 78, or a unique fragment thereof. 

Further provided is an isolated AAVX Capsid protein or a unique fragment thereof. 
AAVX capsid protein is encoded by ORF 2 of AAVX. Further provided are the individual 
AAVX capsid proteins, VP1 , VP2 and VP3 or unique fragments thereof. Thus, provided is an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO:21 (AAV-X1 
VP1). Also provided is an isolated polypeptide having the amino acid sequence set forth in 
SEQ ID NO:22 (AAV-Xlb VP1). Also provided is an isolated polypeptide having the amino 
acid sequence set forth in SEQ ID NO:23 (AAV-X5 VP1). Also provided is an isolated 
polypeptide having the amino acid sequence set forth in SEQ ID NO:24 (AAV-X19 VP1). 
Also provided is an isolated polypeptide having the amino acid sequence set forth in SEQ ID 
NO:25 (AAV-X21 VP1). Also provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO:26 (AAV-X22 VP1). Also provided is an isolated 
polypeptide having the amino acid sequence set forth in SEQ ID NO:27 (AAV-X23 VP1). 
Also provided is an isolated polypeptide having the amino acid sequence set forth in SEQ ID 
NO:28 (AAV-X24 VP1). Also provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO:29 (AAV-X25 VP1). Also provided is an isolated 
polypeptide having the amino acid sequence set forth in SEQ ID NO:30 (AAV-X26 VP1). 
Further provided is an isolated polypeptide consisting essentially of the amino acid sequence 
set forth in SEQ ID NO:21, 22, 23, 24, 25, 26, 27, 28, 29, or 30. 

By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
any AAVX capsid gene that is of sufficient length to be found only in the AAVX capsid 
protein. Substitutions and modifications of the amino acid sequence can be made as described 
above and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. However, an AAVX Capsid polypeptide including all three coat proteins will 
have greater than about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 89%, 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, 98.5%, 99%, 99.5% or 99.9% overall homology to the 
polypeptide encoded by the nucleotides set forth in SEQ ID NOS:21-30. , 
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Also provided herein are isolated AAV-X1 , AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, and AAV-X26 viruses. In one aspect, the 
isolated viruses can be used to produce antibodies specific for each AAVX. Thus, provided is 
an isolated antibody that specifically binds an AAVX-specific protein. The isolated viruses can 
be used to detect antibodies specific for each AAVX. 

Thus, provided is an isolated antibody that specifically binds an AAVX Rep protein, or 
a unique epitope thereof. Thus, also provided is an isolated antibody that specifically bind 
AAVX Rep 52 or AAVX Rep 78, or a unique fragment thereof. Additionally provided is an 
isolated antibody that specifically binds any of the AAVX capsid proteins (VP1, VP2 or VP3), 
a unique epitope thereof, or the polypeptide comprising all three AAVX coat proteins. Also 
provided is an isolated antibody that specifically binds the AAVX capsid protein having the 
amino acid sequence set forth in SEQ ID NO:21 (AAV-X1 VP1), or that specifically binds a 
unique fragment thereof. Also provided is an isolated antibody that specifically binds the 
AAVX capsid protein having the amino acid sequence set forth in SEQ ID NO:22 (AAV-Xlb 
VP1), or that specifically binds a unique fragment thereof. Also provided is an isolated 
antibody that specifically binds the AAVX capsid protein having the amino acid sequence set 
forth in SEQ ID NO:23 (AAV-X5 VP1), or that specifically binds a unique fragment thereof. 
Also provided is an isolated antibody that specifically binds the AAVX capsid protein having 
the amino acid sequence set forth in -SEQ ID NO:24 (AAV-X19 VP1), or that specifically 
binds a unique fragment thereof. Also provided is an isolated antibody that specifically binds 
the AAVX capsid protein having the amino acid sequence set forth in SEQ ID NO:25 (AAV- 
X21 VP1), or that specifically binds a unique fragment thereof. Also provided is an isolated 
antibody that specifically binds the AAVX capsid protein having the amino acid sequence set 
forth in SEQ ID NO:26 (AAV-X22 VP1), or that specifically binds a unique fragment thereof 
Also provided is an isolated antibody that specifically binds the AAVX capsid protein having 
the amino acid sequence set forth in SEQ ID NO:27 (AAV-X23 VP1), or that specifically 
binds a unique fragment thereof Also provided is an isolated antibody that specifically binds 
the AAVX capsid protein having the amino acid sequence set forth in SEQ ID NO:28 (AAV- 
X24 VP1), or that specifically binds a unique fragment thereof Also provided is an isolated 
antibody that specifically binds the AAVX capsid protein having the amino acid sequence set 
forth in SEQ ID NO:29 (AAV-X25 VP1), or that specifically binds a unique fragment thereof 
Also provided is an isolated antibody that specifically binds the AAVX capsid protein having 
the amino acid sequence set forth in SEQ ID NO:30 (AAV-X26 VP1), or that specifically 
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binds a unique fragment thereof. Again, any given antibody can recognize and bind one of a 
number of possible epitopes present in the polypeptide; thus only a unique portion of a 
polypeptide (having the epitope) needs to be present in an assay to determine if the antibody 
specifically binds the polypeptide. 

The antibody can be a component of a composition that comprises an antibody that 
specifically binds the AAVX protein. The composition can further comprise, e.g., serum, 
serum-free medium, or a pharmaceutically acceptable carrier such as physiological saline, etc. 

By "an antibody that specifically binds" an AAVX polypeptide or protein is meant an 
antibody that selectively binds to an epitope on any portion of the AAVX peptide such that the 
antibody binds specifically to the corresponding AAVX polypeptide without significant 
background. Specific binding by an antibody further means that the antibody can be used to 
selectively remove the target polypeptide from a sample comprising the polypeptide or and can 
readily be determined by radioimmunoassay (RIA), bioassay, or enzyme-linked 
immunosorbant (ELISA) technology. An ELISA method effective for the detection of the 
specific antibody-antigen binding can, for example, be as follows: (1) bind the antibody to a 
substrate; (2) contact the bound antibody with a sample containing the antigen; (3) contact the 
above with a secondary antibody bound to a detectable moiety (e.g., horseradish peroxidase 
enzyme or alkaline phosphatase enzyme); (4) contact the above with the substrate for the 
enzyme; (5) contact the above with a color reagent; (6) observe the color change. 

An antibody can include antibody fragments such as Fab fragments which retain the 
binding activity. Antibodies can be made as described in, e.g., Harlow and Lane, Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1988). 
Briefly, purified antigen can be injected into an animal in an amount and in intervals sufficient 
to elicit an immune response. Antibodies can either be purified directly, or spleen cells can be 
obtained from the animal. The cells are then fused with an immortal cell line and screened for 
antibody secretion. Individual hybridomas are then propagated as individual clones serving as 
a source for a particular monoclonal antibody. 

Additionally provided is a method of screening a cell for infectivity by AAVX, 
comprising contacting the cell with AAVX and detecting the presence of AAVX in the cells. 
AAVX particles can be detected using any standard physical or biochemical methods. For 
example, physical methods that can be used for this detection include DNA based methods 
such as 1) polymerase chain reaction (PCR) for viral DNA or RNA or 2) direct hybridization 
with labeled probes, and immunological methods such as by 3) antibody directed against the 
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viral structural or non- structural proteins. Catalytic methods of viral detection include, but are 
not limited to, detection of site and strand specific DNA nicking activity of Rep proteins or 
replication of an AAV origin- containing substrate. Reporter genes can also be utilized to 
detect cells that transduce AAVX. For example, p-gal, green fluorescent protein or luciferase 
can be inserted into a recombinant AAVX. The cell can then be contacted with the 
recombinant AAVX, either in vitro or in vivo and a colorimetric assay could detect a color 
change in the cells that would indicate transduction of AAVX in the cell. Additional detection 
methods are outlined in Fields, Virology, Raven Press, New York, New York. 1996. 

Provided is a method of screening a cell for infectivity by AAVX, wherein the presence 
of AAVX in the cells is determined by nucleic acid hybridization methods, a nucleic acid 
probe for such detection can comprise, for example, a unique fragment of any of the AAVX 
nucleic acids provided herein. The uniqueness of any nucleic acid probe can readily be 
determined as described herein. Additionally, the presence of AAVX in cells can be 
determined by fluorescence, antibodies to gene products, focus forming assays, plaque lifts, 
Western blots and chromogenic assays. The nucleic acid can be, for example, the nucleic acid 
whose nucleotide sequence is set forth in SEQ ID NOS:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 
14, 15, 16, 17, 18, 19, or 20 or a unique fragment thereof. 

Provided is a method of determining the suitability of an AAVX vector for 
administration to a subject comprising contacting an antibody-containing sample from the 
subject with an antigenic fragment of an isolated AAVX Rep or Capsid protein, and detecting 
an antibody-antigen reaction in the sample, the presence of a neutralizing reaction indicating 
the AAVX vector to be unsuitable for use in the subject. Further provided is a method of 
determining the presence in a subject of an AAVX-specific antibody comprising contacting an 
antibody-containing sample from the subject with an antigenic fragment of an isolated AAVX 
Rep or Capsid protein and detecting an antibody-antigen reaction in the sample, the presence 
of a reaction indicating the presnec of an AAVX-specific antibody in the subject. The present 
methods of determining the suitability of an AAVX vector for administration to a subject or 
the presence of an AAVX-specific antibody in a subject can comprise contacting an antibody- 
containing sample from the subject with a unique antigenic or immunogenic fragment of an 
AAVX Rep protein (e.g. Rep 52, Rep 78) and detecting an antibody-antigen reaction in the 
sample, the presence of a reaction indicating the presence of an AAVX-specific antibody and 
therefore the AAVX vector to be unsuitable for use in the subject . The AAVX Rep proteins 
are provided herein, and their antigenic fragments are routinely determined. The AAVX 
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capsid protein can be used to select an antigenic or immunogenic fragment, for example from 
the amino acid sequence set forth in SEQ ID NO:21 (AAV-X1 VP1), the amino acid sequence 
set forth in SEQ ID NO:22 (AAV-Xlb VP1), the amino acid sequence set forth in SEQ ID 
NO:23 (AAV-X5 VP1), the amino acid sequence set forth in SEQ ID NO:24 (AAV-X 1 9 VP1), 
the amino acid sequence set forth in SEQ ID NO:25 (AAV-X21 VP1), the amino acid 
sequence set forth in SEQ ID NO:26 (AAV-X22 VP1), the amino acid sequence set forth in 
SEQ ID NO:27 (AAV-X23 VP1 , the amino acid sequence set forth in SEQ ID NO:28 (AAV- 
X24 VP1), the amino acid sequence set forth in SEQ ID NO:29 (AAV-X25 VPl),or the amino 
acid sequence set forth in SEQ ED NO:30 (AAV-X26 VP1) 

Alternatively, or additionally, an antigenic or immunogenic fragment of an isolated 
AAVX Rep protein can be utilized in this determination method. Any given antibody can 
recognize and bind one of a number of possible epitopes present in the polypeptide; thus only a 
unique portion of a polypeptide (having the epitope) may need to be present in an assay to 
determine if the antibody specifically binds the polypeptide. 

The AAVX polypeptide fragments can be analyzed to determine their antigenicity, 
immunogenicity and/or specificity. Briefly, various concentrations of a putative 
immunogenically specific fragment are prepared and administered to a subject and the 
immunological response (e.g., the production of antibodies or cell mediated immunity) of an 
animal to each concentration is determined. The amounts of antigen administered depend on 
the subject, e.g. a human, rabbit or a guinea pig, the condition of the subject, the size of the 
subject, etc. Thereafter an animal so inoculated with the antigen can be exposed to the AAVX 
viral particle or AAVX protein to test the immunoreactivity or the antigenicity of the specific 
immunogenic fragment. The specificity of a putative antigenic or immunogenic fragment can 
be ascertained by testing sera, other fluids or lymphocytes from the inoculated animal for cross 
reactivity with other closely related viruses, such as AAV1-1 1, AAAV, or BAAV. 

By the "suitability of an AAVX vector for administration to a subject" is meant a 
determination of whether the AAVX vector will elicit a neutralizing immune response upon 
administration to a particular subject. A vector that does not elicit a significant immune 
response is a potentially suitable vector, whereas a vector that elicits a significant, neutralizing 
immune response (e.g. at least 90%) is thus likely to be unsuitable for use in that subject. 
Significance of any detectable immune response is a standard parameter understood by the 
skilled artisan in the field. For example, one can incubate the subject's serum with the virus, 
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then determine whether that virus retains its ability to transduce cells in culture. If such virus 
cannot transduce cells in culture, the vector likely has elicited a significant immune response. 

Alternatively, or additionally, one skilled in the art could determine whether or not 
AAVX administration would be suitable for a particular cell type of a subject. For example, 
the artisan could culture muscle cells in vitro and transduce the cells with AAVX in the 
presence or absence of the subject's serum. If there is a reduction in transduction efficiency, 
this could indicate the presence of a neutralizing antibody or other factors that may inhibit 
transduction. Normally, greater than 90% inhibition would have to be observed in order to rule 
out the use of AAVX as a vector. However, this limitation could be overcome by treating the 
subject with an immunosuppressant that could block the factors inhibiting transduction. 

As will be recognized by those skilled in the art, numerous types of immunoassays are 
available for use in the present methods to detect binding between an antibody and an AAVX 
polypeptide as provided herein. For instance, direct and indirect binding assays, competitive 
assays, sandwich assays, and the like, as are generally described in, e.g., U.S. Pat Nos. 
4,642,285; 4,376,110; 4,016,043; 3,879,262; 3,852,157; 3,850,752; 3,839,153; 3,791,932; and 
Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, N. Y. 
(1988). For example, enzyme immunoassays such as immunofluorescence assays (IF A), 
enzyme linked immunosorbent assays (ELISA) and immunoblotting can be readily adapted to 
accomplish the detection of the antibody. An ELISA method effective for the detection of the 
antibody bound to the antigen can, for example, be as follows: (1) bind the antigen to a 
substrate; (2) contact the bound antigen with a fluid or tissue sample containing the antibody; 
(3) contact the above with a secondary antibody specific for the antigen and bound to a 
detectable moiety (e.g., horseradish peroxidase enzyme or alkaline phosphatase enzyme); (4) 
contact the above with the substrate for the enzyme; (5) contact the above with a color reagent; 
(6) observe color change. 

The antibody-containing sample of this method can comprise any biological sample 
which would contain the antibody or a cell containing the antibody, such as blood, plasma, 
serum, bone marrow, saliva and urine. 

Also provided is a method of producing the AAVX virus by transducing a cell with the 
nucleic acid encoding the virus. 

The present method further provides a method of delivering an exogenous nucleic acid 
to a cell comprising administering to the cell an AAVX particle containing a vector comprising 
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the nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering 
the nucleic acid to the cell. 

The AAV ITRs in the vector for the herein described delivery methods can be AAVX 
ITRs. Furthermore, the AAV ITRs in the vector for the herein described nucleic acid delivery 
methods can also comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, 
AAV9, AAV10, AAV1 1, AAAV, or BAAV inverted terminal repeats. 

Also provided is a method of delivering an exogenous nucleic acid to a subject 
comprising administering to a cell of or from the subject an AAVX particle containing a vector 
comprising the nucleic acid inserted between a pair of AAV inverted terminal repeats, and 
returning the cell to the subject, thereby delivering the nucleic acid to the subject. The AAV 
ITRs can be any AAV ITRs, including AAVX ITRs, AAV5 ITRs and AAV2 ITRs. For 
example, in an ex vivo administration, cells are isolated from a subject by standard means 
according to the cell type and placed in appropriate culture medium, again according to cell 
type {see, e.g., ATCC catalog). Viral particles are then contacted with the cells as described 
above, and the virus is allowed to transduce the cells. Cells can then be transplanted back into 
the subject's body, again by means standard for the cell type and tissue (e. g., in general, U.S. 
Patent No. 5,399,346; for neural cells, Dunnett, S.B. and Bjorklund, A., eds., Transplantation: 
Neural Transplantation-A Practical Approach, Oxford University Press, Oxford (1992)). If 
desired, prior to transplantation, the cells can be studied for degree of transduction by the virus, 
by known detection means and as described herein. Cells for ex vivo transduction followed by 
transplantation into a subject can be selected from those listed above, or can be any other 
selected cell. Preferably, a selected cell type is examined for its capability to be transfected by 
AAVX. Preferably, the selected cell will be a cell readily transduced with AAVX particles; 
however, depending upon the application, even pells with relatively low transduction 
efficiencies can be useful, particularly if the cell is from a tissue or organ in which even 
production of a small amount of the protein or antisense RNA encoded by the vector will be 
beneficial to the subject. 

Further provided is a method of delivering an exogenous nucleic acid to a cell in a 
subject comprising administering to the subject an AAVX particle containing a vector 
comprising the nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby 
delivering the nucleic acid to a cell in the subject. Administration can be an ex vivo 
administration directly to a cell removed from a subject, such as any of the cells listed above, 
followed by replacement of the cell back into the subject, or administration can be in 
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vivo administration to a cell in the subject. For ex vivo administration, cells are isolated from a 
subject by standard means according to the cell type and placed in appropriate culture medium, 
again according to cell type {see, e.g., ATCC catalog). Viral particles are then contacted with 
the cells as described above, and the virus is allowed to transfect the cells. Cells can then be 
transplanted back into the subject's body, again by means standard for the cell type and tissue 
(e. g., for neural cells, Dunnett, S.B. and Bjorklund, A., eds., Transplantation: Neural 
Transplantation-A Practical Approach, Oxford University Press, Oxford (1992)). If desired, 
prior to transplantation, the cells can be studied for degree of transfection by the virus, by 
known detection means and as described herein. 

Further provided is a method of delivering a nucleic acid to a cell in a subject having 
neutralizing antibodies to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, 
AAV9, AAV 10, AAV1 1, AAAV, or BAAV comprising administering to the subject an AAVX 
particle containing a vector comprising the nucleic acid, thereby delivering the nucleic acid to 
a cell in the subject. A subject that has neutralizing antibodies to AAV1, AAV2, AAV3, 
AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1 , AAAV, or BAAV can 
readily be determined by any of several known means, such as contacting AAV1, AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV 
protein(s) with an antibody-containing sample, such as blood, from a subject and detecting an 
antigen-antibody reaction in the sample. Delivery of the AAV1, AAV2, AAV3, AAV4, 
AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV particle can be by 
either ex vivo or in vivo administration as herein described. Thus, a subject who might have an 
adverse immunogenic reaction to a vector administered in an AAV1, AAV2, AAV3, AAV4, 
AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV viral particle can 
have a desired nucleic acid delivered using an AAVX particle. This delivery system can be 
particularly useful for subjects who have received therapy utilizing AAV1, AAV2, AAV3, 
AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV particles 
in the past and have developed antibodies to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, 
AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV. An AAVX regimen can now be 
substituted to deliver the desired nucleic acid. 

In any of the methods of delivering exogenous nucleic acids to a cell or subject 
described herein, the AAVX-conjugated nucleic acid or AAVX particle-conjugated nucleic 
acids described herein can be used. 
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In vivo administration to a human subject or an animal model can be by any of many 
standard means for administering viruses, depending upon the target organ, tissue or cell. 
Virus particles can be administered orally, parenterally (e.g., intravenously), by intramuscular 
injection, intrarectally, by direct tissue or organ injection, by intraperitoneal injection, 
topically, transdermal ly, via aerosol delivery, via the mucosa or the like. Viral nucleic acids 
(non-encapsidated) can also be administered, e.g., as a complex with cationic liposomes, or 
encapsulated in anionic liposomes. The present compositions can include various amounts of 
the selected viral particle or non-encapsidated viral nucleic acid in combination with a 
pharmaceutically acceptable carrier and, in addition, if desired, may include other medicinal 
agents, pharmaceutical agents, carriers, adjuvants, diluents, etc. Parental administration, if 
used, is generally characterized by injection. Injectables can be prepared in conventional 
forms, either as liquid solutions or suspensions, solid forms suitable for solution or suspension 
in liquid prior to injection, or as emulsions. Dosages will depend upon the mode of 
administration, the disease or condition to be treated, and the individual subject's condition, but 
will be that dosage typical for and used in administration of other AAV vectors, such as AAV2 
vectors. Often a single dose can be sufficient; however, the dose can be repeated if desirable. 
Administration methods for gene delivery to the cochlea are routine and are described in Jero, 
J. et al. (Gene Ther. 2001 Mar 20;12(5):539-48) and Staecker H, et al. (Acta Otolaryngol. 2001 
Jan; 12 1(2): 157-63), both references herein incorporated by reference for these methods. 

Administration methods can be used to treat brain disorders such as Parkinson's 
disease, Alzheimer's disease, and demyelination disease. Other diseases that can be treated by 
these methods include metabolic disorders such as musculoskeletal diseases, cardiovascular 
disease, cancer, and autoimmune disorders. 

Administration of this recombinant AAVX virion to the cell can be accomplished by 
any means, including simply contacting the particle, optionally contained in a desired liquid 
such as tissue culture medium, or a buffered saline solution, with the cells. The virion can be 
allowed to remain in contact with the cells for any desired length of time, and typically the 
virion is administered and allowed to remain indefinitely. For such in vitro methods, the virion 
can be administered to the cell by standard viral transduction methods, as known in the art and 
as exemplified herein. Titers of virus to administer can vary, particularly depending, upon the 
cell type, but will be typical of that used for AAV transduction in general which is well known 
in the art. Additionally the titers used to transduce the particular cells in the present examples 
can be utilized. 



53 



WO 2006/119432 PCT/US2006/017157 

The cells that can be transduced by the present recombinant AAVX virion can include 
any desired cell, such as the following cells and cells derived from the following tissues, 
human as well as other mammalian tissues, such as primate, horse, sheep, goat, pig, dog, rat, 
and mouse and avian species: Adipocytes, Adenocyte, Adrenal cortex, Amnion, Aorta, Ascites, 
Astrocyte, Bladder, Bone, Bone marrow, Brain, Breast, Bronchus, Cardiac muscle, Cecum, 
Cervix, Chorion, Cochlear, Colon, Conjunctiva, Connective tissue, Cornea, Dermis, 
Duodenum, Embryonic stem cells, Endometrium, Endothelium, Endothelial cells, Epithelial 
tissue, Epithelial cells, Epidermis, Esophagus, Eye, Fascia, Fibroblasts, Foreskin, Gastric, 
Glial cells, Glioblast, Gonad, Hepatic cells, Histocyte, Hair cells in the inner ear, auditory 
(organ of Corti) sensory epithelia, vestibular sensory epithelia, Ileum, Intestine, small Intestine, 
Jejunum, Keratinocytes, Kidney, Larynx, Leukocytes, Lipocyte, Liver, Lung, Lymph node, 
Lymphoblast, Lymphocytes, Macrophages, Mammary alveolar nodule, Mammary gland, 
Mastocyte, Maxilla, Melanocytes, Mesenchymal, Monocytes, Mouth, Myelin, Myoblasts 
Nervous tissue, Neuroblast, Neurons, Neuroglia, Osteoblasts, Osteogenic cells, Ovary, Palate, 
Pancreas, Papilloma, Peritoneum, Pituicytes, Pharynx, Placenta, Plasma cells, Pleura, Prostate, 
Rectum, Salivary gland, Skeletal muscle, Skin, Smooth muscle, Somatic, Spleen, Squamous, 
Stem cells, Stomach, Submandibular gland, Submaxillary gland, Synoviocytes, Testis, 
Thymus, Thyroid, Trabeculae, Trachea, Turbinate, Umbilical cord, Ureter, Uterus, and 
vestibular hair cells. 

For example, provided herein is a method of transducing a cancer cell (e.g., lung cancer 
cell, non-small cell lung cancer cell), colon cell, CNS derived cell, ovarian cell, prostate cell, 
breast derived cell, cervical cord cell, kidney cell, salivary gland cell, or muscle cell using an 
AAV particle disclosed herein. 

Also provided herein is a method of delivering a nucleic acid to a cancer cell (e.g., lung 
cancer cell, non-small cell lung cancer cell), colon cell, CNS derived cell, ovarian cell, prostate 
cell, breast derived cell, cervical cord cell, kidney cell, salivary gland cell, or muscle cell 
comprising administering to the cell an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 particle containing a vector 
comprising the nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby 
delivering the nucleic acid to the cell. 

The cell of the provided methods can be an inner ear epithelial cell. Thus, the cell of 
the provided method can be an inner ear hair cell. The cell of the provided methods can be an 
inner or outer hair cell of the organ of Corti or a vestibular hair cell. The cell of the provided 
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methods can be an inner ear supporting cell such as Hensen's, phalangal, interdental, or 
vestibular supporting cells. 

The cell of the provided method can be an airway epithelial cell. The cell of the 
provided method can be a columnar, goblet or basal cell. 

The cell of the provided method can be a cell of the submandibular gland. The cell of 
the provided method can be a ductal or acinar cell 

Provided are recombinant vectors based on AAVX. Such vectors may be useful for 
transducing erythroid progenitor cells or cells resistant to transduction by other serotypes of 
AAV. These vectors may also be useful for transducing cells with a nucleic acid of interest in 
order to produce cell lines that could be used to screen for agents that interact with the gene 
product of the nucleic acid of interest. In addition to transduction of other cell types, 
transduction of erythroid cells would be useful for the treatment of cancer and genetic diseases 
which can be corrected by bone marrow transplants using matched donors. Some examples of 
this type of treatment include, but are not limited to, the introduction of a therapeutic gene such 
as genes encoding interferons, interleukins, tumor necrosis factors, adenosine deaminase, 
cellular growth factors such as lymphokines, blood coagulation factors such as factor VIII and 
IX, cholesterol metabolism uptake and transport protein such as EpoE and LDL receptor, and 
antisense sequences to inhibit viral replication of, for example, hepatitis or HIV. 

Provided is a vector, comprising the AAVX virus as well as AAVX viral particles. 
While AAVX is similar to AAV1-1 1, the viruses are found herein to be physically and 
genetically distinct. These differences endow AAVX with some unique advantages, which 
better suit it as a vector for gene therapy. 

Furthermore, as shown herein, AAVX capsid protein is distinct from AAV1, AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, or BAAV 
capsid proteins and exhibits different tissue tropism. AAV1, AAV2, AAV3, AAV4, AAV5, 
AAV6, AAV7, AAV8, AAV9, AAV10, AAV1 1, AAAV, BAAV, and AAVX likely utilize 
distinct cellular receptors. AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, 
AAV9, AAV10, AAV1 1, AAAV, BAAV, and AAVX are serologically distinct and humans 
are not reported to have neutralizing antibodies to AAVX, thus in a gene therapy or gene 
transfer application, AAVX would allow for transduction of a patient who already possess 
neutralizing antibodies to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, 
AAV9, AAV10, AAV1 1, AAAV, or BAAV either as a result of natural immunological 
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defense or from prior exposure to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, 
AAVS, AAV9, AAV10, AAV1 1, AAAV, or BAAV vectors. 

Each of the herein disclosed AAVX vectors, or vectors comprising AAVX capsid 
proteins, can have unique or distinct tissue tropism. As shown in Figure 1 1, transduction 
profiles of AAV-X1, AAV-X5, AAV-X25 and AAV-X26 were different from that of known 
AAVs. AAVX1 and AAVX25 utilize a 2-3 link sialic acid as a cell attachment factor, whereas 
AAVX26 does not utilize sialic acid in the transduction process. Further, AAV 12 (AAVX26) 
can be used to transduce cells lacking cell surface heparan sulfate or glycosphingolipids. 
Instead, mannosamine can be part of the AAV12 receptor or attachment. 

As disclosed herein, cells that can be transduced by AAV12 include, but are not limited 
to, non-small cell lung cancer cells (e.g., NSCLC), (e.g., A549, EKVX, NCI-H226), colon cells 
(e.g., HCT-15), CNS derived cells (e.g., SF-268 and SF295), ovarian cells (e.g., IGROV1), 
prostate cells (e.g., PC-3), breast derived cells (e.g., T-47D), kidney cells (e.g., 293T), salivary 
glands (e.g., ductal cells), muscle cells. 

Further, many of the AAV contaminated simian adenoviruses were originally isolated 
from pooled primary kidney cell cultures (Hull et al. 1957, 1956, 1958) originally established 
as part of a vaccine development program (VR-195, VR-197, VR-198, VR-200, VR-202, VR- 
209 and VR-353), but also from rectal swabs (VR-204), cervical cord (VR-355) and CNS 
cultures (VR-207) demonstrating that the tropism of the AAVs isolated from these adenovirus 
stocks include cells of kidney, cervical cord and CNS origin. 

Thus, provided herein is a method of transducing a cancer cell (e.g., lung cancer cell, 
non-small cell lung cancer cell), colon cell, CNS derived cell, ovarian cell, prostate cell, breast 
derived cell, cervical cord cell, kidney cell, salivary gland cell, or muscle cell using an AAV 
particle disclosed herein. 

Thus, provided herein is a method of delivering a nucleic acid to a cancer cell (e.g., 
lung cancer cell, non-small cell lung cancer cell) comprising administering to the cell an 
AAV 12 particle containing a vector comprising the nucleic acid inserted between a pair of 
AAV inverted terminal repeats, thereby delivering the nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a colon cell 
comprising administering to the cell an AAV 12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 
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Thus, provided herein is a method of delivering a nucleic acid to a CNS derived cell 
comprising administering to the cell an AAV 12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to an ovarian cell 
comprising administering to the cell an AAV12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a prostate cell 
comprising administering to the cell an AAV 12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a breast derived cell 
comprising administering to the cell an AAV12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a cervical cord cell 
comprising administering to the cell an AAV12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a kidney cell 
comprising administering to the cell an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 particle containing a vector 
comprising the nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby 
delivering the nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a salivary gland cell 
comprising administering to the cell an AAV12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

Thus, provided herein is a method of delivering a nucleic acid to a muscle cell 
comprising administering to the cell an AAV 12 particle containing a vector comprising the 
nucleic acid inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 
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VECTOR SYSTEM 

Provided herein is a vector system for producing infectious virus particles having a 
characteristic of AAVX. As used herein, a "vector system" is a combination of one or more 
vectors that, when added to an appropriate cell system, can produce a recombinant AAVX 
virion, as provided herein. Thus, the provided vector system comprises at least one vector 
comprising a nucleic acid selected from the group consisting of a pair of AAV ITRs, a nucleic 
acid encoding an AAV capsid protein, and a nucleic acid encoding an AAV Rep protein. In 
addition, it is understood that an AAV vector system comprises at least one adenovirus helper 
plasmid. 

The vector system can comprise one or more unique AAV vectors. Thus, the vector 
system can comprise, for example, 1,2,3, 4, 5, or 6 unique AAV vectors. In a three-vector 
system, the first AAV vector can comprise a nucleic acid encoding an AAV capsid protein, the 
second AAV vector can comprise a nucleic acid encoding an AAV Rep protein, and the third 
AAV vector can comprise a pair of AAV ITRs (Table 3). It is understood that Rep and Cap 
sequences can be efficiently combined in the same vector. Thus, in an example of a two- vector 
vector system, the first AAV vector can comprise a nucleic acid encoding an AAV capsid 
protein and a nucleic acid encoding an AAV Rep protein and the second AAV vector can 
comprise a pair of AAV ITRs (Table 3). It is understood that at least one AAV vector in the 
provided AAVX vector system comprises an AAVX capsid, Rep or ITR (Table 3). 

Thus, provided is an AAV vector system, wherein the first vector can comprise a 
nucleic acid encoding an AAVX capsid protein and the second vector can comprise a pair of 
AAV ITRs. The AAV ITRs of the second vector can be a pair of AAV1 ITRs. The AAV 
inverted terminal repeats can be a pair of AAV2 ITRs. The AAV ITRs can be a pair of AAV3 
ITRs. The AAV ITRs can be a pair of AAV4 ITRs. The AAV ITRs can be a pair of AAV5 
ITRs. The AAV ITRs can be a pair of AAV6 ITRs. The AAV ITRs can be a pair of AAV7 
ITRs. The AAV ITRs can be a pair of AAV8 ITRs. The AAV ITRs can be a pair of AAV9 
ITRs. The AAV ITRs can be a pair of AAV10 ITRs. The AAV ITRs can be a pair of AAV1 1 
ITRs. The AAV ITRs can be a pair of AAAV ITRs. The AAV ITRs can be a pair of BAAV 
ITRs. The AAV ITRs can be a pair of AAVX ITRs. 

Also provided is an AAV vector system, wherein the first vector can comprise a nucleic 
acid encoding an AAV capsid protein and the second vector can comprise a pair of AAVX 
ITRs. The capsid protein can be an AAV1 capsid protein. The capsid protein can be an AAV2 
capsid protein. The capsid protein can be an AAV3 capsid protein. The capsid protein can be 
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an AAV4 capsid protein. The capsid protein can be an AAV 5 capsid protein. The capsid 
protein can be an AAV6 capsid protein. The capsid protein can be an AAAV Rep protein. The 
capsid protein can be a BAAV Rep protein. The capsid protein can be an AAVX Rep protein. 

In either of the above vector systems, the first vector or a third vector can further 
comprise a nucleic acid encoding an AAV Rep protein. The AAV Rep protein can be AAV1 
Rep protein. The AAV Rep protein can be AA V2 Rep protein. The AAV Rep protein can be 
AAV3 Rep protein. The AAV Rep protein can be AAV4 Rep protein. The AAV Rep protein 
can be AAV5 Rep protein. The AAV Rep protein can be AAV6 Rep protein. The AAV Rep 
protein can be AAV7 Rep protein. The AAV Rep protein can be AAV8 Rep protein. The AAV 
Rep protein can be AAV9 Rep protein. The AAV Rep protein can be AAV 10 Rep protein. The 
AAV Rep protein can be AAV1 1 Rep protein. The AAV Rep protein can be AAAV Rep 
protein. The AAV Rep protein can be BAAV Rep protein. The AAV Rep protein can be 
AAVX Rep protein. 

Also provided is an AAV vector system, wherein the first vector can comprise a nucleic 
acid encoding an AAVX Rep protein and the second vector can comprise a pair of AAV ITRs. 
The AAV ITRs of the second vector can be a pair of AAV 1 ITRs. The AAV inverted terminal 
repeats can be a pair of AAV2 ITRs. The AAV ITRs can be a pair of AAV3 ITRs. The AAV 
ITRs can be a pair of AAV4 ITRs. The AAV ITRs can be a pair of AAV5 ITRs. The AAV 
ITRs can be a pair of AAV6 ITRs. The AAV ITRs can be a pair of AAV7 ITRs. The AAV 
ITRs can be a pair of AAV8 ITRs. The AAV ITRs can be a pair of AAV9 ITRs. The AAV 
ITRs can be a pair of AAV10 ITRs. The AAV ITRs can be apair of AAV11 ITRs. The AAV 
ITRs can be a pair of AAAV ITRs. The AAV ITRs can be a pair of BAAV ITRs. The AAV 
ITRs can be a pair of AAVX ITRs. 

The first vector or a third vector can further comprise a nucleic acid encoding an AAV 
Capsid protein. The capsid protein can be an AAV1 capsid protein. The capsid protein can be 
an AAV2 capsid protein. The capsid protein can be an AAV3 capsid protein. The capsid 
protein can be an AAV4 capsid protein. The capsid protein can be an AAV5 capsid protein. 
The capsid protein can be an AAV6 capsid protein. The capsid protein can be an AAAV Rep 
protein. The capsid protein can be a BAAV Rep protein. The capsid protein can be an AAVX 
Rep protein. 1 

0 

Table 3: AAVX Vector Systems 

First Vector Second Vector Third Vector 

AAVX capsid + AAV Rep AAVITR 
AAV capsid + AAV Rep AAVX ITR 

AAV capsid + AAVX Rep AAVITR 
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AAVX capsid AAV ITR AAV Rep 

AAV capsid AAVX ITR AAV Rep 

AAV capsid , AAV ITR AAVX Rep 

"AAV" includes: AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, 
AAV9, AAV10, AAV1 1, AAAV, BAAV, AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26. 
"AAVX" includes: AAV-X1, AAV-Xlb, AAV-X5, AAV-X 1 9, AAV-X21, AAV- 
X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26. 

In either of the above vector systems, the second vector comprising a pair of AAV ITRs 
can further comprise a promoter between the ITRs. The promoter can be AAV2 p5 promoter. 
The promoter can be AAV5 p5 promoter. The promoter can be AAVX p5 promoter. 
Furthermore, smaller fragments of p5 promoter that retain promoter activity can readily be 
determined by standard procedures including, for example, constructing a series of deletions in 
the p5 promoter, linking the deletion to a reporter gene, and determining whether the reporter 
gene is expressed, Le. $ transcribed and/or translated. The promoter can be the AAVX pl9 
promoter. The promoter can be the AAVX p40 promoter. The promoter can be a promoter of 
any of the AAV serotypes. The promoter can be a constitutive promoter. Thus, the promoter 
can be CMV. The promoter can be RSV. The promoter can be LTR. The promoter can be eFl . 
The promoter can be beta actin promoter. The promoter can be a tissue specific promoter. The 
promoter can be an inducible promoter. The promoter can further be functionally linked to an 
exogenous nucleic acid. 

Further provided is any of the disclosed vectors of the vector systems encapsidated into 
an AAV particle. The AAV particle can be an AAV1 virus particle comprising at least one 
AAV1 capsid protein. The AAV particle can be an AAV2 virus particle comprising at least one 
AAV2 capsid protein. The AAV particle can be an AAV3 virus particle comprising at least one 
AAV3 capsid protein. The AAV particle can be an AAV4 virus particle comprising at least one 
AAV4 capsid protein. The AAV particle can be an AAV5 virus particle comprising at least one 
AAV5 capsid protein. The AAV particle can be an AAV6 virus particle comprising at least one 
AAV6 capsid protein. The AAV particle can be an AAV7 virus particle comprising at least one 
AAV7 capsid protein. The AAV particle can be an AAV8 virus particle comprising at least one 
AAV8 capsid protein. The AAV particle can be an AAV9 virus particle comprising at least one 
AAV9 capsid protein. The AAV particle can be an AAV 10 virus particle comprising at least 
one AAV10 capsid protein. The AAV particle can be an AAV1 1 virus particle comprising at 
least one AAV1 1 capsid protein. The AAV particle can be an AAAV virus particle comprising 
at least one AAAV capsid protein. The AAV particle can be a BAAV virus particle comprising 
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at least one B AA V capsid protein. . The AAV particle can be an AAVX virus particle 
comprising at least one AAVX capsid protein. The AAV particle can be a chimeric capsid 
virus particle (described above) comprising a capsid protein from more than one serotype of 
AAV. 

Examples 

It is understood that the disclosed method and compositions are not limited to the 
particular methodology, protocols, and reagents described as these may vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular 
embodiments only, and is not intended to limit the scope of the present invention which will be 
limited only by the appended claims. 

Example 1 : Identification and Characterization of Novel AAV Isolates in ATCC 
Virus Stocks 

Materials and Methods 

Cell culture and virus propagation: 293T and COS cells were maintained in DMEM, 
supplemented with 10% FBS, 2mM L-glutamine, lOOU/ml penicillin, and O.lmg/ml 
streptomycin or RPMI with 5% FBS. SF295, MCF7, EKVX, Igrovl, CAKI, Ovcar 5 cells were 
cultured in RPMI supplemented with 5% FBS, 2mM L-glutamine, lOOU/ml penicillin, and 
O.lmg/ml streptomycin. Cells were maintained at 37°C in a 5% CO2 humidified atmosphere. 

Screening for novel AAVs: 109 virus samples, obtained from the ATCC (American 
Type Culture Collection) as lysate of infected cells, were analyzed for the presence of AAV 
DNA by PCR as described earlier (Katano, H., et al. 2004. Biotechniques 36:676-80). Briefly, 
low molecular weight DNA was purified with High Pure Viral Nucleic Acid Kit (Roche). 
These DNA samples were assayed for AAV contamination by PCR using the GC Rich PCR 
Kit (Roche). This method detects the presence of AAV DNA by PCR using degenerative PCR 
primers, which were shown to amplify a fragment containing sequences of the rep and vp ORF 
of all known AAV serotypes. PCR using DNA isolated from ATCC VR-195, VR-197, VR- 
198, VR-200, VR-202, VR-204, VR-207, VR-209, VR-353, VR-355, VR-942, VR-943 as 
template resulted in the generation of a 1.4kb amplification product, which was subsequently 
cloned using the TOPO TA Cloning KIT (Invitrogen) and sequenced with an ABI Prism 3100 
Genetic Analyzer (ABI) and FS dye-terminator chemistry (ABI). The obtained sequences 
showed homology to AAV6 rep ORF and cap ORF but were not identical to any known AAV. 
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Viral DNA isolation cloning and sequencing: The rep and cap ORF of the new AAVs 
was PCR amplified and subcloned. Viral DNA that was isolated from VR-195, VR-197, VR- 
198, VR-200, VR-202, VR-204, VR-207, VR-209, VR-353, VR-355, VR-942, VR-943 was 
PCR amplified with primers: 

AAV1-4 225(+): GCGACAKTTTGCGACACCAYGTGG (SEQ ID NO:31) and 

UNI-NC: CCANNNGGAATCGCAATGCCAAT (SEQ ID NO:32), or 

UNIC: ATGNTNATNTGGTGGGAGGAGGG (SEQ ID NO:33) and 

AAV1-4 polyA4400(-): CGAATNAAMCGGTTTATTGATTAAC (SEQ ID NO:34), or 

AAV 4500 (+): CAATAAACCGkkTnATTCGTkTCAGT (SEQ ID NO:38) and 

AAV 450 (-): ACANNWGAGTC AGAAATKCCNGGCAG (SEQ ID NO:39) 

(N can be A, C, G, or T; K can be G or T; Y can be C or T; M can be A or C) 

to amplify the Rep ORF, capsid ORF, and ITR(3 5 -AAVX terminus, ITR and 3'- 
terminus of circular or concatermerized AAVs), respectively. The PCR products were 
subcloned using the TOPO TA Cloning KIT (Invitrogen) and at least three clones of each 
isolate* were sequenced with an ABI Prism 3 100 Genetic Analyzer (ABI) and FS dye- 
terminator chemistry (ABI). The obtained sequences showed homology to AAV1 and AAV6 
but were not identical to any known AAV. As shown in Table 5, there are three naming 
conventions used herein. For example, the newly identified AAVs were named according to 
the ATCC adenovirus strains from which they were discovered (e.g., AAV(VR-943)). Also, 
each unique AAV isolate was assigned a temporary identifier (e.g., AAV-X26). Finally, . . . 
(e.g., AAV-12). 

Identical AAV sequences were detected in VR-195, VR-197, VR-198 and VR-202 and 
named AAV-X1. VR-195 also contained a homolog of AAV-X1 termed AAV-Xlb. 

AAV detected in VR-200 was named AAV-X19. AAV detected in VR-204 was named 
AAV-X21 . AAV detected in VR-207 was named AAV-X22. AAV detected in VR-209 was 
named AAV-X23. AAV detected in VR-353 was named AAV-X24. AAV detected in VR-355 
was named AAV-X25. AAV detected in VR-942 was named AAV-X5. AAV detected in VR- 
943 was named AAV-X26. (Table 5). 

Sequence analysis: DNA and protein sequence alignments were performed using the 
Clustal W multiple sequence alignment tool of the Biology Workbench web based software 
(SDSC), MacVector 7 (Oxford Molecular). The sequences amplified from ATCC virus stocks 
were compared to sequences in GenBank using BLAST (Basic Local Alignment Search Tool 
at http://www.ncbi.nlm.nih.gov/BLAST/). DNA alignments were performed using the 
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ClustalW multiple sequence alignment tool of the Biology Workbench web based software at 
http://seqtool.sdsc.edu (SDSC) and MacVector 7 (Accelrys, Burlington, MA). Divergent amino 
acids in VP1 were mapped on the capsid by superimposing the VP1 sequence onto a 
pseudoatomic structure for AAV-6. The AAV6 crystal structure was predicted by remodeling 
the AAV2 capsid structure, obtained from the Virus Particle Explorer, at the Swiss-Model 
server with the AAV6 capsid sequence. 

AAV-X17 partial ITR sequence was 98% identical to corresponding AAV2 ITR 
sequence. AAV-X22 partial ITR sequence was 99% identical to corresponding AAV2 ITR 
sequence. AAV-X25 partial ITR sequence was 96% identical to corresponding AAV1 ITR 
sequence. AAV-X26 was only about 80% identical to corresponding AAV2 ITR sequence. 
However, the TRS signal (e.g., aa 176-181 of SEQ ID NO:40) and Rep Binding site (e.g., aa 
195-210 of SEQ ID NO:40) were conserved in AAV-X26 ITR as compared to the AAV2 ITR. 

Generation of recombinant virus: Recombinant AAV-X1, AAV-Xlb, AAV-X19, 
AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, AAV-X5, and AAV-X26 were 
generated by transfecting 293 T cells with AAV2 vector plasmid (AAV2-NLS-GFP) consisting 
of an GFP expression cassette flanked by AAV-2 ITRs and AAV-X1, AAV-Xlb, AAV-X19, 
AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, AAV-X5, or AAV-X26 packaging 
(subcloned UNI-C/ AAV 1-4 polyA4400(-) PCR fragment), an AAV-2 like Rep expression 
plasmid, AAV-X26_Rep, (subcloned AAV-X26 Rep ORE, AAV1-4 225(+) and UNI-NC 
PCR fragment), and Ad helper plasmids, 449B, which provided the essential adenovirus 
functions that are required for AAV replication (Smith, R. H., et al. (2002). Biotechniques 
33(1): 204-6, 208, 210-1). Two confluent T175 flasks of 293T cells were harvested, 
resuspended in 100ml DMEM 10%FCS, seeded in five 150mm plates and incubated at 37 °C, 
5% C0 2 until cells are 80% confluent (typically 48h). Cells were transfected with 15ng 
pAAV2-NLS-GFP, 10^g AAV-X1, AAV-Xlb, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, AAV-X25, AAV-X5, or AAV-X26 packaging plasmid, 5fig AAV-X26_Rep and 
30fig p449B. 48h after transfection, cells were harvested, washed with PBS and resuspended in 
1 1ml TD buffer (0.14 M NaCl, 5.0 mM KC1, 0.7 mM K 2 HP0 4 , 25.0 mM Tris, pH7.4. Cells 
were lyzed by 3 freeze thaw cycles and incubated for 30 minutes at 37°C after adding 
benzonase to a final concentration of 20 U/ml sodium deoxycholate (final concentration of 
0.5%). After adding 0.55g/ml CsCl, the lysate was fractionated using density gradient 
centrifugation in a SW41 rotor for 48h at 38,000 ipm. The gradients were harvested in 0.5 ml 
aliquots. Aliquots were assayed for infectivity and DNase-resistant genome copy titers of the 
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vector preparations were determined by quantitative real-time PCR using the TAQMAN 
system (Applied Biosystems, Inc. (ABI), Foster City, CA) with probes specific to the CMV 
promoter. 

Recombinant virus was successfully generated using each of the AAVX packaging 
plasmids with AAV2 ITR vector plasmids and AAV-X26 rep plasmids. Recombinant virus 
was also successfully generated using Rep plasmids derived from each of the AAVX viruses in 
combination with AAV2 ITR vector plasmids. There is therefore sufficient homology between 
AAVX and AAV2 Rep and ITR sequences that they can be used interchangeably in the herein 
provided vectors and vector systems: 

Neutralization assay: Exponentially growing COS cells were plated at a density of 5 x 
103/well in a flat-bottom 96-well plate. Twenty- four hours after seeding, cells were incubated 
for 60 min with 2 x 106 rAAV particles that were pre-incubated with serial dilutions of pooled 
human IgGs (Immune Globuline Intravenous, 10%, Gamunex, BAYER, Leverkusen, 
Germany). Twenty-four hours after infection, cells were analyzed for GFP expression with the 
Guava PC A-96 (Guava Technologies, Hayward, CA) fluorescent cell counter. GFP expression 
was used as a surrogate marker for transduction efficiency. 

Determination of tissue tropism: Transduction efficiency of recombinant AAV-X1, 
AAV-Xlb, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, AAV-X5, or 
AAV-X26 vector containing an expression cassette for GFP was analyzed in cancer cell lines. 
Cells were infected with dilutions ranging from 10 6 to 10 9 particles/well 24-48h after 
infection, cells were analyzed for GFP expression by flow cytometry. 

Neuraminidase treatment: To analyze if sialic acid is required for transduction of AAV- 
XI and AAV-X25, neuraminidases were used to digest cell surface sialic acid before infection. 
Cos cells were seeded at 5,000 cells/well in a 96 well plate. 24h after seeding, cells were 
incubated for 60 min with a broad spectrum neuraminidase from Arthrobacter ureafaciens 
(0.05-1 U/ml) or a 2-3 linkage specific neuraminidase from S. pneumoniae (0.04-0.2U/ml) 
(Calbiochem, La Jolla, CA). Cells were washed and transduced with a multiplicity of infection 
(MOI) of 500 with recombinant AAV6, AAV-X1 or AAV-X25 particles expressing GFP. Cells 
were analyzed for GFP expression by flow cytometry with the Guava PCA-96 (Guava 
Technologies) 24 h after transduction. 

Binding assay: COS cells were seeded at 5000 cells/well in a 96-well plate. Forty-eight 
hours after seeding, cells were incubated for 60 min at 37°C with neuraminidases from A. 
ureafaciens (0.1 U/ml), S. pneumoniae (0.08 U/ml) (Calbiochem) or mock. Cells were then 
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chilled for 30 min at 4°C and then incubated for 60 min at 4°C with 5 x 1 0 7 recombinant 
AAV6, AAV(VR-195), or AAV(VR-355) particles expressing GFP. Cells were then washed 
twice with cold medium and phosphate buffered saline (PBS) and lysed in 50^1 PCRnGo 
buffer (Pierce). Copy numbers of cell-associated vector genomes in the cell lysates were 
determined by quantitative real-time PCR using the TAQMAN system (Applied Biosystems) 
with probes specific to the CMV promoter 

Lectin competition: COS cells were seeded at 5,000-10,000 cells/well in a 96 well plate. 
16-24h after seeding, cells were precubated for 30 min at 4°C with 1 00 jig/ml of either ConA, 
Malll, LCA, ECL, WGA, UEA I, DBA, PNA, SBA, GSL I, PSA, LCA, PHA-E, PHA-L, SJA, 
succinylated WGA. Subsequently, cells were washed and transduced for 60 min at 4°C with 
3000 transducing units or 10 8 recombinant AAV-2, AAV-4, AAV-5, AAV-6, BAAV, AAV- 
XI, AAV-X5, AAV-X25 or AAV-X26 particles expressing GFP in 50 jul medium 
supplemented with lectins (100 ^ig/ml). 24h after transduction, cells were analyzed for GFP 
expression by flow cytometry. 

Sugar competition: COS cells were seeded at 5,000 cells/well in a 96 well plate. 24h 
after seeding, cells were precubated for 30 min at 4°C before being transduced for 60 min at 
4°C with 10 8 recombinant AAV-2, AAV-4, AAV-5, AAV-6, BAAV, AAV-X1, AAV-X5, 
AAV-X25 or AAV-X26 particles expressing GFP in 50 nl medium supplemented with various 
sugars (see Fig. 9). 24h after transduction, cells were analyzed for GFP expression by flow 
cytometry. 

Heparin competition assay: COS cells were plated at a density of 5 x 10 3 /well in a flat- 
bottom 96-well plate. After 24 h, 10 7 particles rAAV6-NLS-GFP, rAAV(VR-195)-NLS-GFP 
and rAAV(VR-355)-NLSGFP were preincubated for 1 h at room temperature in medium 
containing 0 to 2000 ng/ml heparin (Sigma, St. Louis, MO). Cells were then transduced for 1 h 
at 37°C with this preincubation mixture, washed with medium and incubated for 24 h. GFP 
expression was detected with a fluorescent cell counter (Guava Technologies). 

NaCl competition assay: COS cells were seeded at 5,000 cells/well in a 96 well plate. 
24h after seeding, were transduced for 60 min at 37°C with MOI of 500 orlO 8 recombinant 
AAV-2, AAV-4, AAV-5, AAV-6, BAAV, AAV-X1, AAV-X5, AAV-X25 or AAV-X26 
particles expressing GFP in 50 medium supplemented with NaCl to 150mM, 300mM, and 
450mM final NaCl concentration. 24h after transduction, cells were analyzed for GFP 
expression by flow cytometry/ fluorescent cell counter (Guava Technologies). 
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Viral cross-competiiion: COS cells were plated at a density of 5 x 10 3 /well in a flat- 
bottom 96-well plate. After 24 h, cells were preincubated for lh at 37°C with increasing titers 
of rAAV6-lacZ, an AAV6 derived vector expressing a nuclear localized j8-galactosidase, 
ranging from 0 to 2.6x 1 0 9 particles/well. The cells were then washed with medium and 
transduced for 45 min with 2xl0 6 particles of rAAV2-NLS-GFP, rAAV6-NLS-GFP, 
rAAV(VR-195)-NLS-GFP, or rAAV(VR-355)-NLS-GFP. GFP expression was analyzed 48 h 
after transduction by flow cytometry. 

Results 

Identification of AAV contaminations in ATCC virus isolates: Viral stocks, supplied by 
the ATCC were analyzed for the presence of AAV DNA by a PCR based assay as described 
earlier (Katano, H., et ai. 2004). PCR primers in this study were designed to bind to highly 
conserved regions in the rep and cap ORFs resulting in amplification of a 1.5kb fragment 
spanning from nucleotide 1437 to 2904 relative to the AAV-2 genome. This method is highly 
sensitive and can detect as few as 10 copies of viral DNA/|iL of sample (Katano, H., et al. 
2004). AAV DNA was detected in 13/137 samples analyzed (Table 4). AAV contaminations 
were detected in 26 % of adenovirus isolates. Interestingly, no AAV was detected in 
herpesvirus, retrovirus, coronavirus, orthomyxovirus, poxvirus, or reovirus stocks. Many of the 
AAV contaminated simian adenoviruses were originally isolated from pooled primary kidney 
cell cultures (Hull, R. N., et al. 1957; Hull, R. N., et al 1958; Hull, R. N., et al. 1956) 
originally established as part of a vaccine development program (VR-195, VR-197, VR-198, 
VR-200, VR-202, VR-209, and VR-353), but also from rectal swabs (VR-204), cervical cord 
. (VR-355), and CNS cultures (VR-207). AAV was also detected in stocks of human adenovirus 
type 9 (VR-10) as well as in bovine adenovirus type 1 (VR-313) and type 2 (VR-314). Ten of 
theses isolates have high similarity to AAV1 and AAV6 (>98%), while AAV-X5 isolated from 
VR-942 and AAV-X26 isolated from VR-943 showed highest homology in the capsid protein 
VP-1 to AAV-3B (93%) and AAV-1 1 (83%) respectively. 

Construction of packaging plasmids and generation of recombinant virus: The entire 
coding region for Rep and Cap of the AAV contaminations detected in VR-195 and VR-355 
termed AAV(VR-195) and AAV(VR-355), respectively, as well as the capsid ORF under 
control of the viral P40 promoter from the adenovirus isolates VR-10, VR-195, VR-197, VR- 
198, VR-200, VR-202, VR-209, VR-353, VR-204, VR-355, and VR-207 were PCR amplified 
and subcioned to generate packaging plasmids. 
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Table 4. Screening of viral samples for the presence of AAV 



Adenovirida 


53 


13 


26% 


primate Adenovirida 


16- 


13 


81% 


Herpesviridae 


43 


0 


0 


Coronaviridae 


9 


0 


0 


Retrovirida 


15 


0 


0 


Reoviridae 


4 


0 


0 


Orthomyxovirida 


4 


0 


0 



Recombinant AAV was produced by cotransfecting plasmids encoding the capsid of 
the novel AAVs, an AAV type 2 Rep expression plasmid, an AAV-2 vector plasmid, encoding 
a nuclear localized GFP together with an adenovirus helper plasmid. Recombinant viruses 
were then assayed for transduction activity. Capsid plasmids encoding functional VP proteins 
were sequenced on both strands. Several clones of each isolate were analyzed. GenBank 
accession numbers are as follows: AAV(VR-195): DQ180604(SEQ ID NO:l) and AAV(VR- 
355): DQ180605 (SEQ ID NO:9). 

The evolutionary relationship among mammalian AAVs and the AAV contaminants 
detected in human and non-human primate adenovirus stocks was analyzed by ClustalW 
alignments of VP1 amino acid sequences and plotted as a rooted phylogenetic tree (Fig. 1). AH 
AAVs detected in simian adenovirus stocks displayed at least 96% homology on the DNA 
level and 98% identity in the capsid amino acid sequence to either AAV1, AAV6, or to each 
other. The amino acid sequence of VP1 encoded by AAV contaminants in VR-197, VR-198, 
and VR-202 were identical to each other but distinct from AAV1 and AAV6. The VP1 
sequence of the only positive human adenovirus stock, VR-10 (adenovirus type 9), was greater 
than 99.9% identical to AAV2. 

AAV-X1(97%, 99%), AAV-X1B (96%, 98%), AAV-X19 (97%, 99%), AAV- 
X21(97%, 99%), AAV-X22 (97%, 99%), AAV-X23(97%, 98%), AAV-X24(97%, 97%), and 
AAV-X25 (97%, 98%) share 96 - 99% sequence identity to AAV6 or AAV1 at the DNA level 
(based on a 3kb partial genome fragment) (see Figure 1). Numbers in brackets indicate 
identities to AAV6 and AAV1 . The amino acid sequence of AAV-X5 capsid protein shows 
highest homology to AAV-3B (93%), and AAV-X26 capsid protein shows highest homology 
to AAV- 11 (83%). AAV-X5 and AAV-X26 DNA sequences show highest homology to AAV- 
3 (88%) and AAV-1 1 (79%) respectively (based on a 3kb partial genome fragment) (see Figure 
1). 
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Mapping of sequence variation: AAV contaminations, which were greater than 98% 
identical to AAV6 in the amino acid sequence of the capsid protein VP1, were detected in 10 
adenovirus isolates. AAV-X5 isolated from VR-942 and AAV-X26 isolated from VR-943 
showed homology (sequence similarity) in the capsid protein VP1 to AAV-3B (93%) and 
AAV-1 1 (83%), respectively, and only low homology to AAV6 (87% and 61%, respectively). 
AAV-X5 and AAV-X26 are therefore very different from the other new AAVs disclosed 
herein. The capsid protein VP1 of AAV(VR-195) and AAV(VR-355) differs in 7 or 6 amino 
acids, respectively, from that of AAV6 (Table 5). To identify the location of the amino acids 
that are unique for AAV(VR-195) and AAV(VR-355) within the capsid, we superimposed the 
AAV(VR-195) and AAV(VR-355) VP1 sequence onto a pseudoatomic structure for AAV-6. 
Divergent regions in the capsid proteins are located on exposed surface loops at the threefold 
axis of symmetry, an area of the capsid that has been associated with receptor binding (Kem, 
A., et al. 2003; Opie, S. R., et al. 2003). AAV(VR-195) specific amino acids of three different 
VP3 subunits were clustered in close proximity. AAV(VR-355) specific changes relative to 
AAV6 were similarly organized. Since the amino acid changes among AAV(VR-195), 
AAV(VR-355), and AAV6 are surface exposed, we hypothesized they may effect antigenicity 
or cell tropism. 



Table 5. Novel AAVs 



Name 



A.K.A. 



were isolated from ATCC adenovirus samples 
Isolated from Classification Family 



AAV-X1, 
AAV-Xlb 

AAV-X1 

AAV-X1 
AAV-X19 

AAV-X1 

AAV-X21 

AAV-X22 

AAV-X23 

AAV-X24 

AAV-X25 

AAV-X5 

AAV-X26 



AAV(VR-195) VR-195 



AAV(VR-197) 
AAV(VR-198) 
AAV(VR-200) 
AAV(VR-202) 
AAV(VR-204) 
AAV(VR-207) 
AAV(VR-209) 
AAV(VR-353) 
AAV(VR-355) 
AAV(VR-942) 
AAV(VR-943) 
AAV-12 



VR-197 
VR-198 
VR-200 
VR-202 
VR-204 
VR-207 
VR-209 
VR-353 
VR-355 
VR-942 



Simian virus 1 Adenovirida 



Simian virus 15 
Simian virus 17 
Simian virus 23 
Simian virus 27 
Simian virus 3 1 
Simian virus 34 
Simian virus 37 
Simian virus 39 
Simian virus 38 
Simian adenovirus 17 



Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 
Adenovirida 



VR-943 Simian adenovirus 18 Adenovirida 



Table 6. Comparison of the amino acid sequence of VP1 
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162 


198 


327 


386 


418 


495 


514 


531 


584 


590 


AAV6 


T 


V 


N 


Q 


D 


35 


R 


K 


L 


ID) 


AAV(VR-195) 


T 


V 


S 


Q . 


E 


G 


H 


E 


F 


H 


AAV(VR-355) 


S 


L 


N 


K 


E 


© 


R 


E 


F 


© 



Numbers on top indicate the amino acid in AAV6 VP1, where either AAV ( VR- 195) or AAV(VR-355) diverges 
from AAV6. Bold are basic amino acids: Outlined are acidic; Italic are polar or slightly acidic; Underlined are 
aromatic. 



Immunological characterization: To test for a difference in the immunological response 
to the isolates, IgGs purified from pooled human serum were assayed to determine if they 
contained neutralizing antibodies against the recombinant AAVs and whether a difference in 
the neutralization activity against either AAV6, AAV(VR-I95) or AAV(VR-355) existed (Fig. 

2) . In this assay, all three viruses displayed similar sensitivity to neutralization with the 
purified pooled IgGs. Thus, the changes in the capsid of AAV(VR-195) or AAV(VR-355) 
compared to AAV6 and do not appear to alter their sensitivity to neutralization by human 
serum. 

Heparin competition: Heparan sulfate, a ubiquitous cell surface glycosaminoglycan, is 
an attachment receptor for AAV2 (Surnmerford, C, et al. 1998). AAV2 transduction can be 
inhibited by heparin, a heparan sulfate analog. Like AAV2, AAV6 can be purified using a 
heparin affinity column; however, transduction is not inhibited by a low heparin concentration 
, indicating that heparan sulfate does not act as an AAV6 receptor (Halbert, C. L., et al 2001). 
To investigate the role of heparan sulfate in AAV(VR-195) or AAV(VR-355) transduction, 
competition transduction experiments were performed with increasing amounts of heparin. At 
low concentrations of heparin (25(ig/ml), AAV2 was the only rAAV that was inhibited (Fig. 

3) . However, at 2000 ng/ml, approximately 50% inhibition of AAV6 and AAV(VR-195) was 
observed. No inhibition was observed with either AAV(VR-355) or AAV5 . Therefore, 
heparan sulfate does not appear to be a co-receptor for either AAV(VR-195) or AAV(VR-355). 

AAV6 and AA V(VR-195) transduction is charge sensitive: Heparin is a highly charged 
molecule. The observed inhibition of AAV6 and AAV(VR-195) at high concentrations of 
heparin could be caused by a charge driven interaction between the viruses and heparin rather 
than a specific interaction. Increasing the ionic strength in the medium during transduction can 
minimize charge dependent interactions (Arnberg, N., et al. 2002). Salt ions can bind to 
charged groups on the cell surface and virus capsid, thereby reducing these electrostatic 
interactions. To analyze if AAV6, AAV(VR-195), and AAV(VR-355) transduction is charge 
dependent, the ionic strength was increased during transduction by adjusting the NaCl 
concentration in the tissue culture medium from 150mM up to 450mM. Transduction of 
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rAAV-6, rAAV(VR-195), and rAAV(VR-355) was inhibited to different amounts by 
increasing concentrations of NaCl (Fig.4). While 250mM NaCl was sufficient to inhibit 
rAAV6 by 50%, higher NaCl concentrations were required to inhibit AAV(VR-195) and 
AAV(VR-355) with IC50 of 350mM and 450mM respectively. The relative dependence on 
charge follows the inhibition of these viruses by heparin and indicates that the inhibition of 
AAV6 and AAV(VR-195) by heparin is charge dependent. 

Effect of neuraminidase treatment on r AAV transduction and vims binding: Dependent 
parvoviruses are also reported to use sialic acid as a co-receptor for binding and transduction 
(Kaludov, N., et al. 2001 ; Seiler, M. P., et al. 2002; Walters, R. W., et al. 2001). To analyze if 
sialic acid is required for transduction with AAV(VR-195) and AAV(VR-355) vectors, the 
effect of the removal of cell surface sialic acids was studied by neuraminidase treatment on 
transduction (Fig. 5A) and virus binding (Fig. 5B). Treating COS cells with a broad -spectrum 
neuraminidase from A. ureafaciens inhibited rAAV6, rAAV(VR-195), and rAAV(VR-355) 
transduction 3-5 fold. Neuraminidase isolated from S. pneumoniae will only remove a 2-3 
linked sialic acid. Treatment of the cells with this enzyme again inhibited transduction with all 
three rAAVs. Removal of cell surface sialic acids with neuraminidases from A. ureafaciens or 
S. pneumoniae also resulted in a greater than 90% reduction in cell binding of rAAV6, 
rAAV(VR-195) and rAAV(VR-355), indicating that these viruses utilize a 2-3 link sialic acid 
as an cell attachment factor. 

AA V6, AA V(VR-195), andAA V(VR~355) can be distinguished based upon lectin 
competition: Lectins are proteins that recognize and bind oligosaccharides conjugated to 
proteins and lipids and can be used to block virus binding (Summerford, C, et al. 1998). Malll, 
a lectin that recognizes a 2- 3 linked sialic acid, inhibited transduction of rAAV-6, rAAV(VR- 
195), and rAAV(VR- 355) (Fig. 6). . 

Many glycoproteins contain a core oligosaccharide structure which includes a-linked 
mannose. To study, if these "high-mannose" proteins play a role in AAV-X1 or AAV-X25 
transduction, these proteins were blocked with ConA, a lectin that recognizes a-linked 
mannose. AAV-6, AAV-X1 or AAV-X25 were blocked equally by ConA, indicating that 
proteins with a mannose core are involved in binding and/ or uptake. Lens Culinaris Agglutinin 
(LCA) recognizes a-linked mannose together with neighboring oligosaccharides. It is less 
broad specific than ConA. LCA inhibited all recombinant viruses tested and had an 
approximately twofold higher inhibitory potential on rAAV6 than on rAAV(VR-195) or 
rAAV(VR355). ECL is a lectin specific toward galactose residues and had the highest binding 
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activity toward galactosyl (P -1,4) N-acetyl glucosamine. ECL blocked AAV-X1 mediated 
transduction less specific than AAV-6 or AAV-X25. While ECL, which recognizes a-mannose 
in conjugation with galactosyl (p -1 ,4) N-acetylglucosamine, did not inhibit rAAV(VR-195), it 
reduced rAAV6 and rAAV(VR-355) transduction 7 and 5-fold, respectively. STL, which 
recognizes Nacetylglucosamine, did not inhibit rAAV(VR-195), but reduced rAAV6 and 
rAAV(VR-355) transduction 30 and 12-fold, respectively. These results indicate that while all 
three viruses bind terminal a 2-3 linked sialic acid, the amino acid changes on the capsid 
surface affected the cell binding activity of each isolate. 

These differences observed between AAV-6, AAV-X1 and AAV-X25 in lectin 
competition experiments and NaCl competition indicates utilization of different receptors or 
differences in receptor interaction that can result in a different cell tropism of vectors based on 
these isolates. The transduction efficiency of AAV-6, AAV-X1 and AAV-X25 based vectors 
were therefore analyzed in human cancer cell lines (Fig.9). Each recombinant virus has a 
unique transduction profile indicating that they bind to different receptors or interact 
differently with a common receptor. 

AAV-X5 isolated from VR-942 and AAV-X26 isolated from VR-943 showed 
homology (sequence similarity) in the capsid protein VP1 to AAV-3B (93%) and AAV-1 1 
(83%), respectively, and only low homology to AAV6 (87% and 61%, respectively). AAV-X5 
and AAV-X26 are therefore very different from the other new AAVs disclosed herein. To 
study the attachment factors that are involved in AAV-X5 and AAV-X26 mediated 
transduction, competition with various carbohydrates was analyzed (Fig. 10). Heparin is a 
known attachment factor/ receptor for AAV2 and addition of Heparin and homolog sugars to 
the medium during transduction resulted in an inhibition of AAV-2 mediated gene transfer. 
Heparin had no effect on AAV-X5 and AAV-X26 mediated gene transfer demonstrating that 
extracellular heparin can not block AAV-X5 and AAV-X26 mediated gene transfer. Heparin 
and related carbohydrates do not appear to have receptor function for these viruses. 

AAV-X5 and AAV-X26 therefore utilize unique uptake pathways/ receptors that are 
different from all other AAVs analyzed thus far. Transduction efficiencies in four cancer cell 
lines were therefore examined (Fig. 11). AAV-X5 and AAV-X26 have unique transduction 
properties and tropism. Thus, the application range and potential use of recombinant vectors 
based on AAV-X5 and AAV-X26 are unique and different from other AAVs described thus 
far. 
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rAA V6, rAA V(VR-195), and rAA V(VR-355) differ in their cell tropism: As a result of 
the few sequence changes on the surface of the rAAV(VR-195) and rAAV(VR-355) capsid, 
each virus appears to exhibit different biological characteristics from each other and from 
rAAV6. To analyze if this difference also results in a change in tropism or transduction 
activity, six human cancer cell lines and African green monkey kidney cells, COS, were 
transduced with recombinant vectors based on rAAV6, rAAV(VR-195), and rAAV(VR-355) 
(Fig. 7). AAV6 and AAV(VR-195) transduced COS cells, the non-small cell lung cancer cell 
line EKVX, ovarian IGROV1, and renal CAKI cells with similar efficiency. However, the 
transduction rates of AAV(VR-195) on ovarian Ovcar5 cells and the CNS derived SF295 cell 
line were approximately 4 times lower compared to AAV6. AAV(VR-355) demonstrated 
efficient gene transfer in COS and EKVX cells, but transduction of Igrovl, CAKI, Ovcar5 and 
SF295 was 10 to 17-fold lower than for AAV6. The different transduction efficiencies of 
rAAV6, rAAV(VR- 1 95), and rAAV(VR-3 5 5) indicate that each isolate may have a distinct 
cell tropism which could be the result of utilization distinct receptors for attachment or 
internalization or have different affinities for a common receptor. 

rAAV6 competition: rAAV6, rAAV(VR-195), and rAAV(VR-355) require (J 2-3 linked 
sialic acid for cell attachment and transduction but differ in their charge dependency, 
sensitivity to lectin competition, and transduction activity on a panel of cells. To investigate if 
these viruses use distinct receptors, competition experiments were used to assay for change in 
transduction between AAV6 and the other isolates. COS cells were pre-incubated with 
increasing doses of rAAV6-lacZ followed by transduction with identical particle titers of either 
rAAV2, rAAV6, rAAV(VR-195), or rAAV(VR-355) expressing GFP. Changes in GFP 
expression were detected by flow cytometry (Fig. 8). rAAV6-LacZ competition had the 
greatest effect on r AA V6-NLS -GFP transduction. Fifty percent inhibition of rAAV6-NLS-GFP 
transduction was observed at approximately 60-fold excess of the competitor, whereas a 220- 
fold excess was required for the same level of inhibition of rAAV(VR-195) or rAAV(VR-355). 
Fifty percent inhibition of rAAV2-NLS-GFP transduction required greater than 250-fold 
particle excess. The stronger inhibition of r AAV6-NLS-GFP by rAAV6-lacZ compared with 
rAAV(VR-195) or rAAV(VR-355) suggests, that while rAAV6, rAAV(VR-195) and 
rAAV(VR-355) share a common attachment factor and potentially a common receptors, 
differences in the attachment factor and receptor interaction exist. 

Example 2: Identification and Characterization of AAV12 
Materials And Methods 
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Cells culture and virus: African green monkey kidney COS cells, obtained from the 
American Type Culture Collection (ATCC) (Manassas, VA), were cultured in RPMI-1640 
medium (Biosource, Camarillo, CA), supplemented with 5% fetal bovine serum (Hyclone, 
Logan, UT), 2 mM L-glutamine, 100 U of penicillin/ml, and 0.1 mg of streptomycin/ml 
(Invitrogen, Carlsbad, CA). Cells were maintained at 37°C under a 5% C0 2 humidified 
atmosphere. Simian Virus 18 isolate, VR-943, was obtained from the ATCC as crude lysate of 
virus-infected cells. 

Subcloning of theAAV12 rep and cap gene: The complete coding region of AAV12 rep 
and cap were PCR amplified and subcloned. DNA was isolated from lysate of Simian Virus 18 
infected cells with the QIAprep Spin Miniprep kit (Qiagen, Valencia, CA).The rep open 
reading frame (ORF) was PCR amplified from this DNA with the primers 
AAV225(+):GCGACAKTTTGCGACACCAYGTGG (SEQ ED NO:44) andUM-NC: 
CCANNNGGAATCGCAATGCCAAT (SEQ ID NO:45). AAV12 cap was amplified with the 
primers UNI-C: 5 '-ATGNTNATNTGGTGGGAGGAGGG-3 ' (SEQ ID NO:46) and AAV1-4 
polyA4400(-):5 '-CGAATOAAMCGGTTTATTGATTAAC-3 ' (SEQ ID NO:47). The PCR 
fragments were subcloned using the TOPO TA Cloning KIT (Invitrogen) resulting in the 
plasmids pAAV12-Rep andpAAV12-Cap. Three clones, that were capable of generating 
recombinant virus, were sequenced with an ABI Prism 3100 Genetic Analyzer (Applied 
Biosystems, Foster City, CA) and FS dye-terminator chemistry (Applied Biosystems). 

Sequence analysis: The sequences of AAV 12 rep and cap were compared to sequences 
in GenBank using BLAST. DNA alignments were performed using the ClustalW multiple 
sequence alignment tool of the Biology Workbench web based software at 
http://seqtool.sdsc.edu (SDSC) and MacVector 7 (Accelrys, Burlington, MA). 

Generation of recombinant virus: AAV12 (AAVX26) vectors, expressing a nuclear 
localized green fluorescent protein (GFP), rAAV12GFP, were produced as described above. 
Briefly, 293T cells were cotransfected with pAAV2-NLS-GFP, pAAV12Rep and pAAV12Cap 
and the Ad helper plasmids 449B. Recombinant particles were purified by CsCl gradient 
centrifiigation. DNase-resistant genome copy numbers of the vector stocks were determined by 
quantitative real-time PCR using the TAQMAN system (Applied Biosystems) with probes 
specific to the CMV promoter. AAV12-Epo expressing human erythropoietin was generated 
accordingly by packaging pAAVhEPO with pAAV12Rep and pAAV12Cap. 

Digestion of cell surface sialic acid: Exponentially growing COS cells were plated at a 
density of 5 x 10 3 /well in a flat-bottom 96-well plate. Twenty-four hours after seeding, cells 
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were incubated for 30 min with O.land 1 mU of the broad-spectrum neuraminidase from 
Vibrio ckolerae (Calbiochem, La Jolla, CA). Cells were then washed with medium and 
transduced with 1 x 10 7 particles of rAAV2-GFP, rAAV4-GFP, rAAV5-GFP or rAAV12-GFP. 
GFP expression, which serves as a surrogate marker for transduction, was detected twenty-four 
hours later with a fluorescent cell counter (Guava PCA-96, Guava Technologies, Hayward, 
CA). 

Heparin competition assay: COS cells were plated at a density of 5 x 10 3 /well in a flat- 
bottom 96-welI plate one day prior to transduction. 2 x 10 7 particles rAAV2-GFP, rAAV12- 
GFP were preincubated for 1 h at room temperature in medium supplemented with either 
heparin, mannose or mannosamine (Sigma, St. Louis, MO). This pre-incubation mixture was 
then added and left on the cells for 1 h at 37°C. Cells were then washed with medium and 
incubated for one day before GFP expression was detected with a fluorescent cell counter 
(Guava Technologies). 

Protease treatment: COS cells were cultured in a 1 5- cm diameter culture dish until 
cells were 80% confluent. Cells were then washed twice with PBS, scraped, resuspended in 10 
ml PBS and incubated with 0.05% trypsin (Biosource), or mock (untreated control) for 15 min 
at 37°C. Cells were then washed twice with medium and seeded at a density of 10,000 
cells/well in a 96-well dish. After 1 h culture at 37°C, cells were transduced with 2 x 10 7 
particles rAAV2-GFP, rAAV12-GFP. Transduction efficiency was determined 24 h later by 
GFP expression detection with a fluorescent cell counter (Guava Technologies). 

Inhibition of glycolipid synthesis: COS cells were plated at a density of 5 x 10 3 /well in 
a flat-bottom 96- well plate. Eight hours after seeding, cells were incubated for 40 h with the 
glucosylceramide synthase inhibitors DL^^Arec>-l-Phenyl-2-palmitoylainino-3-morpholino-l-. 
propanol (PPMP) (Sigma, St. Louis, MO). Cells were then washed with medium and 
transduced with 2 xlO 8 particles of rAAV2-GFP, rAAV12-GFP and rBAAV-GFP for 1 h. GFP 
expression was analyzed 48 h after transduction by detection with a fluorescent cell counter 
(Guava Technologies). 

Neutralization assay: COS cells were seeded at a density of 5 x 10 3 /well in a 96-well 
plate 1 day before inoculation. 2 x 10 7 rAAV2-GFP, rAAV6-GFP and rAAV12-GFP particles 
that were pre-incubated with serial dilutions of pooled human IgGs (Immune Globuline 
Intravenous, 10%, Gamunex, BAYER, Leverkusen, Germany) in medium for 1 h at room 
temperature. Cells were then inoculated with this mixture for 1 h at 37°C and then washed 
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with medium. Twenty-four hours after transduction, cells were analyzed for GFP expression by 
flow cytometry (Guava Technologies). 

Animal experiments: Male Balb/c mice were obtained from the Division of Cancer 
Treatment, NCI, Bethesda, MD. Mice were administered 10 9 particles (suspended in 100^1 of 
0.9% NaCl) of either AAV2-hEPO (n=3) or AAV12~hEPO (n=2) by retrograde ductal delivery 
to both submandibular glands (SG). Two additional groups (n=3 and n=2, respectively) 
received an equal dose of 10 9 particles (suspended in 100/xl of 0.9% NaCl) in both their tibialis 
anterior (two injection sites per muscle). A further group (n=3) of naive mice (administered 
with 50/xl of 0.9% NaCl to each SG) was included. Mild anesthesia was induced to all 
participating animals with 1 \i\Jg body weight of a 60 mg/ml ketamine (Phoenix Scientific, St. 
Joseph, MO) and 8 mg/ml xylazine (Phoenix Scientific) solution given intramuscularly. Blood 
samples were obtained by orbital bleeding at distinct time points. Hematocrits (Hcts) were 
determined using micro-hematocrit capillary tubes (Fisher Scientific, Pittsburgh, PA). 
Secretion of hEPO in mouse serum was determined by an ELISA using commercial assay kits 
(R&D systems, MN, USA). The lower limit of detection was 0.6 mU/ml. Assays were 
performed according to the manufacturer's instructions. 

Results 

Identification ofAA V contaminations in ATCC virus isolates: AAV sequences were 
detected in a Simian Adenovirus 18 strain C676 stock, VR-943, which was isolated from a 
Vervet monkey (Cercopithecus aethiops). The entire rep and cap coding region of the AAV 
contamination in VR-943 /termed AAV 12, has been PCR amplified and subcloned (Fig.l). 
Since the rep and cap encoding PCR fragments also contain viral promoter elements, these 
plasmids could be used as packaging constructs for the generation of recombinant virus. 
Vectors based on AAV 12 were produced by cotransfecting an AAV-2 vector plasmid, 
encoding a nuclear localized GFP flanked by AAV2 inverted terminal repeats (ITRs), and 
plasmids encoding AAV 12 rep and cap together with an adenovirus helper plasmid. 
Recombinant viruses were then assayed for transduction activity by inoculating COS cells and 
assaying for GFP expression by flow cytometry. Three clones, which were capable of 
generating recombinant virus were sequenced. 

Phylogenetic analysis: The evolutionary relationship among mammalian AAVs and 
AAV12 was analyzed by ClustalW alignments of genomic, Rep78 and VP1 sequences and 
plotted as a rooted phylogenetic tree (Fig. 1). The DNA sequence of AAV12 showed highest 
homology with AAV1 1 and AAV4, 83% and 8 1 % respectively, whereas lowest similarity was 
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observed with AAV5 (63%). The Rep78 amino acid sequence of AAV12 demonstrated high 
homology to AAV13, AAV4, AAV 10 and AAV1 1 with 89% or 88% identity. For the capsid 
protein VP 1, highest homology was observed with AAV1 1 and AAV4, 84% and 78% 
respectively, whereas AAV5 VP1 displayed lowest similarity with 53%. 

Heparin competition: Heparan sulfate, a cell surface glycosaminoglycan, which is 
expressed by virtually all cells, is an attachment receptor for AAV2 (Summerford, C, et al. 
1998). AAV2 transduction can be inhibited by heparin, a heparan sulfate analog. Thus, 
analysis was conducted to determine whether AAV 12 uses heparan sulfate as a receptor by 
heparin competition experiments. COS cells were transduced with rAAV12 in the presence of 
increasing amounts of heparin. rAAV2 served as a control. rAAV2 transduction was inhibited 
to 75% at a concentration of 12.5|ig/ml heparin, whereas no inhibition of rAAV12 was 
observed at a 80 hold higher heparin concentration of lOOO^g/ml (Fig. 12). Since heparin has 
no inhibitory potential towards AAV 12 transduction, heparan sulfate appears not to be 
involved in AAV 12 transduction. 

Effect of neuraminidase treatment on rAA VI 2 transduction: Sialic acids, a family of 
monosaccharides based on N-acetylneuramic acid, are commonly found on the outermost end 
of glykans and glycolipids. Sialic acids have been identified to serve as receptors for several 
viruses, including influenza virus, AAV4, AAV5 AAV6 and AAV(VR-355). To analyze, if 
AAV 12 transduction is sialic acid dependent, the effect of the removal of cell surface sialic 
acids was studied by neuraminidase treatment on gene transfer (Fig. 134). Enzymatic digestion 
of COS cells with a broad spectrum neuraminidase from Vibrio cholera inhibited rAAV4 and 
rAAV5 transduction dose-dependently and blocked gene transfer up to 99% and 97% 
respectively. AAV 12 transduction was unaffected by the enzymatic removal of cell surface 
sialic acids, indicating that AAV 12 does not utilize sialic acid in the transduction process. 

rAA VI 2 transduction is protease sensitive and does not require glycosphingolipids 
(GSLs): rBAAV cell entry and transduction depend on gangliosides, glycosphingolipids with 
sialic acid groups, and are resistant to protease treatment of the cell. To analyze if the rAAV12 
receptor is a protein and if glycolipids are involved in the transduction process, the effect of 
proteolytically digesting cell surface proteins prior to transduction was studied (Fig.l4A). The 
dependency of rAAV12 transduction on GSLs was studied by incubating cells with DL-rfireo- 
l-Phenyl-2-palmitoylamino-3-morpholino-l-propanol (PPMP), a glucosylceramide synthase 
inhibitors, which act to deplete GSLs from the cell membrane prior to transduction (Fig.l4B). 
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As shown in Figurel4, AAV12 transduction is protease sensitive, but AAV12 can transduce 
cells lacking glycosphingolipids. 

rAA VI 2 is inhibited by extracellular mannosamine: Since rAAV12 transduction does 
not depend on either heparin or sialic acid, a broad panel of carbohydrates was screened for the 
ability to interfere with rAAV12 transduction to identify components that might be involved in 
the rAAV12 cell interaction. In this assay, mannosamine was identified as compound with 
inhibitory function against rAAV12. Preincubation of rAAV12 with mannosamine prior to 
transduction resulted in a dose-dependent inhibition of rAAV12 transduction (Fig.15), while 
no specific inhibition of rAAV2 was observed. This result indicates that mannosamine could 
be part of the r AAV 12 receptor or attachment factor. 

rAA VI 2 has a broad tropism in human cancer cell lines: All studied AAVs utilize 
either sialic acid or heparin as cellular receptor or attachment factor. In contrast, rAAV12 
trasduced cells independently of heparin and sialic acid but was inhibited by mannosamine, 
indicating that rAAV12 interacts in a unique way with the cell. To analyze if unique cell 
interaction results in a unique tropism, the transduction efficiency of rAAV12 was studied in 
1 3 human cancer cell lines and compared it to rAAV4, a virus that shows 78% homology to 
rAAV12 in the capsid protein VP1 (Fig.16). rAAV12 and rAAV4 demonstrated both abroad 
tropism with overall similar transduction efficiencies, but both viruses had a unique 
transduction profile. While only rAAV4 transduced HCT10 cells, rAAV12 gene transfer was 
specific for A549 and HCT15 cells. 

Immunological characterization of rAAV12: Infections with AAV2 are very common 
and approximately 80% of the human population are seropositive. Neutralization of human 
AAV serotypes, such as AAV2, AAV3 and AAV5, as well as the simian AAV6 by human 
serum have been reported. This pre-existing immunity against these AAVs might limit their 
usability as vectors for gene therapy. To test if r AAV 12 is antigentically distinct from rAAV2 
and rAAV6, assays were conducted to determine if IgGs, purified from pooled human serum, 
contain neutralizing antibodies against the recombinant AAVs and whether there was a 
difference in the neutralization activity against either rAAV2, AAV6, or rAAV12 (Fig. 17). 
In this assay the human rAAV2 and simian rAAV6 displayed similar sensitivity to 
neutralization with the purified pooled IgGs and transduction was inhibited to 50% at a 
concentration of O.Olmg/ml IgGs. In contrast, rAAV12 was highly resistant to neutralization 
by pooled human IgGs, and even at the highest concentration of 0.67mg/ml, a concentration 



77 



WO 2006/119432 PCT/US2006/017157 
where 1 00% inhibition of rAAV2 and rAAV6 was observed, rAA V 1 2 transduction was only 

reduced by 30%. 

rAA VI 2 transduces salivary glands and skeletal muscles in vivo: rAAV12 transduced 
salivary glands and skeletal muscles with similar efficiency as rAAV2 (Fig. 18). 
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CLAIMS 

What is claimed is: 

1. A nucleic acid vector comprising an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26-specific nucleic acid 
or a nucleic acid encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, 
AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26-specific protein. 

2. The nucleic acid vector of claim 1, wherein the nucleic acid encoding an AAV-X1, 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV- 
X25, or AAV-X26-specific protein encodes an AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 capsid 
protein or an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 Rep protein. 

3. The nucleic acid vector of claim 2, wherein the AAV-X1 capsid protein is selected 
from the group consisting of AAV-X1 VP1, AAV-X1 VP2, and AAV-X1 VP3. 

4. The nucleic acid vector of claim 3, wherein the AAV-X1 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:21. 

5. The nucleic acid vector of claim 2, wherein the AAV-Xlb capsid protein is selected 
from the group consisting of AAV-Xlb VP1, AAV-Xlb VP2, and AAV-Xlb VP3. 

6. The nucleic acid vector of claim 5, wherein the AAV-Xlb VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:22. 

7. The nucleic acid vector of claim 2, wherein the AAV-X5 capsid protein is selected 
from the group consisting of AAV-X5 VP1, AAV-X5 VP2, and AAV-X5 VP3. 

8. The nucleic acid vector of claim 7, wherein the AAV-X5 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:23. 

9. The nucleic acid vector of claim 2, wherein the AAV-X19 capsid protein is selected 
from the group consisting of AAV-X19 VP1, AAV-X19 VP2, and AAV-X19 VP3. 

10. The nucleic acid vector of claim 9, wherein the AAV-X19 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:24. 

1 1 . The nucleic acid vector of claim 2, wherein the AAV-X21 capsid protein is selected 
from the group consisting of AAV-X21 VP1 , AAV-X21 VP2, and AAV-X21 VP3. 

12. The nucleic acid vector of claim 1 1, wherein the AAV-X21 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:25. 

85 



WO 2006/119432 PCT/US2006/0 17157 

13. The nucleic acid vector of claim 2, wherein the AAV-X22 capsid protein is selected 
from the group consisting of AAV-X21 VP1, AAV-X21 VP2, and AAV-X21 VP3. 

14. The nucleic acid vector of claim 11, wherein the AAV-X22 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:26. 

15. The nucleic acid vector of claim 2, wherein the AAV-X23 capsid protein is selected 
from the group consisting of AAV-X23 VP1, AAV-X23 VP2, and AAV-X23 VP 3. 

16. The nucleic acid vector of claim 15, wherein the AAV-X23 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:27. 

17. The nucleic acid vector of claim 2, wherein the AAV-X24 capsid protein is selected 
from the group consisting of AAV-X24 VP1, AAV-X24 VP2, and AAV-X24 VP3. 

18. The nucleic acid vector of claim 17, wherein the AAV-X24 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:28. 

19. The nucleic acid vector of claim 2, wherein the AAV-X25 capsid protein is selected 
from the group consisting of AAV-X25 VP1, AAV-X25 VP2, and AAV-X25 VP 3. 

20. The nucleic acid vector of claim 19, wherein the AAV-X25 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:29. 

2 1 . The nucleic acid vector of claim 2, wherein the AAV-X26 capsid protein is selected 
from the group consisting of AAV-X26 VP1, AAV-X26 VP2, and AAV-X26 VP3. 

22. The nucleic acid vector of claim 21, wherein the AAV-X26 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:30. 

23. The nucleic acid vector of claim 2, wherein the AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 Rep 
protein is selected from the group consisting of Rep40, Rep52, Rep68 and Rep78. 

24. The nucleic acid vector of claim 23, wherein the AAV-X1 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:49. 

25 . The nucleic acid vector of claim 23, wherein the AAV-X25 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:51. 

26. The nucleic acid vector of claim 23, wherein the AAV-X26 VP1 capsid protein has the 
amino acid sequence set forth in SEQ ID NO:53. 

27. The nucleic acid vector of claim 2, comprising a pair of AAV-X1, AAV-Xlb, AAV- 
X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV- 
X26 inverted terminal repeats and a promoter between the inverted terminal repeats. 

28. The nucleic acid vector of claim 27, wherein the promoter is an AAV promoter p5. 
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29. The nucleic acid vector of claim 28, wherein the p5 promoter is AAV-X1, AAV-Xlb, 
AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or 
AAV-X26 p5 promoter. 

30. The particle of claim 28, wherein the p5 promoter is an AAV1 p5 promoter, an AAV2 
p5 promoter, an AAV3 p5 promoter, an AAV4 p5 promoter, AAV5 p5 promoter, an 
AAV6 p5 promoter, an AAAV p5 promoter, or a BAAV p5 promoter. 

3 1 . The nucleic acid vector of claim 27, further comprising an exogenous nucleic acid 
functionally linked to the promoter. 

32. The nucleic acid vector of claim 27, encapsidated in an adeno-associated virus particle. 

33. The nucleic acid vector of claim 32, wherein the particle is an AAV-X1, AAV-Xlb, 
AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or 
AAV-X26 particle. 

34. The nucleic acid vector of claim 32, wherein the particle is an AAV1 particle, an 
AAV2 particle, an AAV3 particle, an AAV4 particle, AAV5 particle, AAV6 particle, 
AAAV particle, or an BAAV particle. 

35. A recombinant AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, or AAV-X26 virion containing a vector comprising 
a pair of AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 inverted terminal repeats. 

36. The virion of claim 35, wherein the vector further comprises an exogenous nucleic acid 
inserted between the inverted terminal repeats. 

37. The nucleic acid vector of claim 1, wherein the AAV-X1, AAV-Xlb, AAV-X5, AAV- 
XI 9, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26-specific 
nucleic acid is apair of AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV- 
X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 inverted terminal repeats or an 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV- 
X24, AAV-X25, or AAV-X26 p5 promoter. 

38. An isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 1 , 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ED NO:8, SEQ ID NO:9, or SEQ ID NO:10. 

39. An isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, or SEQ ID NO: 10. 
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40. An isolated nucleic acid that selectively hybridizes with the nucleic acid of claim 39. 

41. An isolated nucleic acid encoding an AAV-X1 , AAV-Xlb, AAV-X5, AAV-X1 9, 
AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein. 

42. The isolated nucleic acid of claim 41, comprising the nucleotide sequence set forth in 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID N0.13, SEQ ID NO:14, SEQ ID NO:15, 
SEQ ID NO: 16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, or SEQ ED NO:20. 

43. The isolated nucleic acid of claim 41, wherein the AAV-X1, AAV-Xlb, AAV-X5, 
AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 
capsid protein has the amino acid sequence set forth in SEQ ID NO:21, SEQ ID NO:22, 
SEQ ID NO:23, SEQ ED NO:24, SEQ ED NO:25, SEQ ID NO:26, SEQ ID NO:27, 
SEQ ID NO:28, SEQ ID NO:29, or SEQ ID NO:30. 

44. An isolated AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein. 

45. The isolated AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein of claim 44, having the amino 
acid sequence set forth in SEQ ID NO:21, SEQ ED NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ED 
NO:29, or SEQ ED NO:30, or a unique fragment thereof. 

46. The isolated AAV-X1, AAV-Xlb, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, or AAV-X25 capsid protein of claim 44, having at least a 98% sequence 
identity to the amino acid sequence set forth in SEQ ED NO:21, SEQ ID NO:22, SEQ 
ED NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ED NO:28, or SEQ 
IDNO:29. 

47. The isolated AAV-X5 capsid protein of claim 44, having at least an 93% sequence 
identity to the amino acid sequence set forth in SEQ ED NO:23 or SEQ ID NO:30. 

48. The isolated AAV-X26 capsid protein of claim 44, having at least an 83% sequence 
identity to the amino acid sequence set forth in SEQ ID NO:23 or SEQ ID NO:30. 

49. An isolated antibody that specifically binds the protein of claim 44. 

50. An AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, AAV-X25, or AAV-X26 particle comprising a capsid protein consisting 
essentially of the amino acid sequence set forth in SEQ ID NO:21, SEQ ID NO:22, 
SEQ ID NO:23, SEQ ID NO:24, SEQ ED NO:25, SEQ ID NO:26, SEQ ID NO:27, 
SEQ ID NO:28, SEQ ID NO:29, or SEQ ID NO:30. 
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A method of screening a cell for infectivity by AAV-X1 , AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26, 
comprising contacting the cell with AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV- 
X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 and detecting the 
presence of AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 in the cells. 

A method of determining the suitability of an AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 vector 
for administration to a subject, comprising contacting an antibody-containing sample 
from the subject with an antigenic fragment of a protein of claim 44 and detecting an 
antibody-antigen reaction in the sample, the presence of a neutralizing reaction 
indicating the AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, or AAV-X26 vector to be unsuitable for use in the 
subject. 

A method of determining the presence in a subject of an AAV-X1, AAV-Xlb, AAV- 
X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV- 
X26 -specific antibody comprising, contacting an antibody-containing sample from the 
subject with an antigenic fragment of the protein of claim 44 and detecting an antibody- 
antigen reaction in the sample, the presence of a reaction indicating the presence of an 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV- 
X24, AAV-X25, or AAV-X26 -specific antibody in the subject. 
A method of delivering a nucleic acid to a cell, comprising administering to the cell an 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV- 
X24, AAV-X25, or AAV-X26 particle containing a vector comprising the nucleic acid 
inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to the cell. 

The method of claim 54, wherein the AAV inverted terminal repeats are AAV-X1, 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV- 
X25, or AAV-X26 inverted terminal repeats. 

A method of delivering a nucleic acid to a subject comprising administering to a cell 
from the subject an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, or AAV-X26 particle comprising the nucleic acid 
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inserted between a pair of AAV inverted terminal repeats, and returning the cell to the 
subject, thereby delivering the nucleic acid to the subject. 

57. A method of delivering a nucleic acid to a cell in a subject comprising administering to 
the subject an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, 
AAV-X23, AAV-X24, AAV-X25, or AAV-X26 particle comprising the nucleic acid 
inserted between a pair of AAV inverted terminal repeats, thereby delivering the 
nucleic acid to a cell in the subject. 

58. A method of delivering a nucleic acid to a cell in a subject having antibodies to AAV2 
comprising administering to the subject an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, 
AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 particle 
comprising the nucleic acid, thereby delivering the nucleic acid to a cell in the subject. 

59. A vector system for producing infectious virus particles having a characteristic of 
AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV- 
X24, AAV-X25, or AAV-X26 comprising: at least one vector comprising a nucleic 
acid selected from the group consisting of a pair of AAV-X1, AAV-Xlb, AAV-X5, 
AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 
inverted terminal repeats, a nucleic acid encoding an AAV-X1, AAV-Xlb, AAV-X5, 
AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 
capsid protein, and a nucleic acid encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV- 
X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 Rep 
protein. 

60. The vector system of claim 59, comprising two vectors. 

6 1 . The vector system of claim 60, wherein the first vector comprises a nucleic acid 
encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 Rep protein and the second vector comprises 
a pair of AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 inverted terminal repeats. 

62. The vector system of claim 60, wherein the first vector comprises a nucleic acid 
encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein and a nucleic acid encoding 
an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, AAV-X25, or AAV-X26 Rep protein and the second vector comprises a 
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pair of AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, AAV-X25, or AAV-X26 inverted terminal repeats. 

63. The vector system of claim 60, wherein the first vector comprises a nucleic acid 
encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein and the second vector 
comprises a pair of AAV inverted terminal repeats. 

64. The vector system of claim 63, wherein the second vector comprises a pair of AAV1 
inverted terminal repeats. 

65. The vector system of claim 63, wherein the second vector comprises a pair of AAV2 
inverted terminal repeats. 

66. The vector system of claim 63, wherein the second vector comprises a pair of AAV3 
inverted terminal repeats. 

67. The vector system of claim 63, wherein the second vector comprises a pair of AAV4 
inverted terminal repeats. 

68. The vector system of claim 63, wherein the second vector comprises a pair of AAV5 
inverted terminal repeats. 

69. The vector system of claim 63, wherein the second vector comprises a pair of AAV6 
inverted terminal repeats. 

70. The vector system of claim 63, wherein the second vector comprises a pair of AAV7 
inverted terminal repeats. 

7 1 . The vector system of claim 63, wherein the second vector comprises a pair of AAV8 
inverted terminal repeats. 

72. The vector system of claim 63, wherein the second vector comprises a pair of AAV9 
inverted terminal repeats. 

73. The vector system of claim 63, wherein the second vector comprises a pair of AAV10 
inverted terminal repeats. 

74. The vector system of claim 63, wherein the second vector comprises a pair of AAV1 1 
inverted terminal repeats. 

75. The vector system of claim 63, wherein the second vector comprises a pair of AAAV 
inverted terminal repeats. 

76. The vector system of claim 63, wherein the second vector comprises a pair of BAAV 
inverted terminal repeats. 
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77. The vector system of claim 63, wherein the second vector comprises a pair of AAV-X1 , 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV- 
X25, or AAV-X26 inverted terminal repeats. 

78. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV1 Rep protein. 

79. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV2 Rep protein. 

80. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV3 Rep protein. 

81. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV4 Rep protein. 

82. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV5 Rep protein. 

83. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV6 Rep protein. 

84. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV7 Rep protein. 

85. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV8 Rep protein. 

86. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV9 Rep protein. 

87. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV 10 Rep protein. 

88. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV 1 1 Rep protein. 

89. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAAV Rep protein. 

90. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding a BAAV Rep protein. 

91. The vector system of claim 63, wherein the first vector further comprises a nucleic acid 
encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 Rep protein. 
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92. The vector system of claim 60, wherein the first vector comprises a nucleic acid 
encoding an AAV capsid protein and the second vector comprises a pair of AAV-X1, 
AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV- 
X25, or AAV-X26 inverted terminal repeats. 

93. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV1 capsid protein. 

94. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV2 capsid protein. 

95. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV3 capsid protein. 

96. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV4 capsid protein. 

97. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV5 capsid protein. 

98. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV6 capsid protein. 

99. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV7 capsid protein. 

1 00. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV8 capsid protein. 

101. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV9 capsid protein. 

102. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV 10 capsid protein. 

103. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV1 1 capsid protein. 

104. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAAV capsid protein. 

105. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding a BAAV capsid protein. 

106. The vector system of claim 92, wherein the first vector comprises a nucleic acid 
encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV- 
X23, AAV-X24, AAV-X25, or AAV-X26 capsid protein. 
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1 07. The vector system of any of claims 92 to 1 05, wherein the first vector further comprises 
a nucleic acid encoding an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, 
AAV-X22, AAV-X23, AAV-X24, AAV-X25, or AAV-X26 Rep protein. 

1 08. The vector system of any of claims 92 to 1 05 , wherein the second vector further 
comprises a promoter between the inverted terminal repeats. 

1 09 . The vector system of claim 1 08, wherein the promoter is functionally linked to an 
exogenous nucleic acid. 

1 10. The vector system of any of claims to 59 to 1 05, wherein a vector of the vector system 
is encapsidated in an AAV particle. 

111. The vector system of claim 1 10, wherein the vector is encapsidated in a dependent 
parvovirus particle. 

112. The vector system of claim 111, wherein the parvovirus particle which encapsidates the 
vector is an AAV1 particle, an AAV2 particle, an AAV3 particle, an AAV4 particle, an 
AAV5 particle, an AAV6 particle, an AAV7 particle, an AAV8 particle, an AAV9 
particle, an AAV10 particle, an AAV1 1 particle, an AAAV particle, a BAAV particle, 
an AAV-X1 particle, an AAV-Xlb particle, an AAV-X5 particle, an AAV-X19 
particle, an AAV-X21 particle, an AAV-X23 particle, an AAV-X24 particle, an AAV- 
X25 particle, or an AAV-X26 particle. 

113. A vector comprising a pair of AAV inverted terminal repeats, a nucleic acid encoding 
an AAV-X1, AAV-Xlb, AAV-X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, 
AAV-X24, AAV-X25, or AAV-X26 capsid protein and a nucleic acid encoding an 
AAV Rep protein. 
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SEQUENCE LISTING 

<110> Government of the United States of America, as represented by the 
Secretary, Department of Health and Human Services 

<120> isolation, cloning and characterization 
of New Adeno-Associated Virus (AAV) serotypes 



<130> 14014. 0433P1 

<150> 60/676,604 
<151> 2005-04-29 

<160> 54 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 4259 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 6 

<223> N can be A, C, G, or T 
<400> 1 

tttgcgacat tttgcgacac cacgtggcca 
aggatcccca ttttgaccgc gaaatttgaa 
atcgtgatca aggtgccgag cgacctggac 
gtgaactggg tggccgagaa ggaatgggag 
ctgattgagc aggcacccct gaccgtggcc 
tggcgccgcg tgagtaaggc cccggaggcc 
tcctacttcc acctccatat tctggtggag 
cgcttcttga gtcagattag ggacaagctg 
accctgccca actggttcgc ggtgaccaag 
gtggtggacg agtgctacat ccccaactac 
tgggcgtgga ctaacatgga ggagtatata 
cggctcgtgg cgcagcacct gacccacgtc 
ctgaacccca attctgacgc gcctgtcatc 
ctggtcgggt ggctggtgga ccggggcatc 
caggcctcgt acatctcctt caacgccgcc 
ctggacaatg ccggcaagat catggcgctg 
cccgctcctc ccgcggacat taaaaccaac 
tacgaccctg cctacgccgg ctccgtcttt 
cgcaacacca tctggctgtt tgggccggcc 
atcgcccacg ccgtgccctt ctacggctgc 
aacgattgcg tcgacaagat ggtgatctgg 
gtggagtccg ccaaggccat tctcggcggc 
tcgtccgccc agatcgatcc cacccccgtg 
gtgattgacg ggaacagcac caccttcgag 
aaatttgaac tcacccgccg tctggagcat 
aaagagttct tccgctgggc gcaggatcac 
agaaagggtg gagccaacaa aagacccgcc 
cgggcctgcc cctcagtcgc ggatccatcg 
tttgccgaca ggtaccaaaa caaatgttct 
ccctgcaaga catgcgagag aatgaatcag 



ttcagggtat atatggccga 
cgagcagcag ccatgccggg 
gagcacctgc cgggcatttc 
ctgcccccgg attctgacat 
gagaagctgc agcgcgactt 
ctcttctttg ttcagttcga 
accacggggg tcaaatccat 
gtgcagacca tctaccgcgg 
acgcgtaatg gcgccggagg 
ctcctgccca agactcagcc 
agcgcgtgtt tgaacctggc 
agccagaccc aggagcagaa 
cggtcaaaaa cctccgcgcg 
acctccgaga agcagtggat 
tccaactcgc ggtcccagat 
accaaatccg cgcccgacta 
cgcatctacc gcatcctgga 
ctcggctggg cccagaaacg 
accacgggca agaccaacat 
gtcaactgga ccaatgagaa 
tgggaggagg gcaagatgac 
agcaaggtgc gcgtggacca 
atcgtcacct ccaacaccaa 
caccagcagc cgttgcagga 
gactttggca aggtgacaaa 
gtgaccgagg tggcgcatga 
cccgatgacg cggataaaag 
acgtcagacg cggaaggagc 
cgtcacgcgg gcatgcttca 
aatttcaaca tttgcttcac 
Page 1 



gtgagcgagc 60 

cttctacgag 120 

tgactcgttt 180 

ggatctgaat 240 

cctggtccaa 300 

gaagggcgag 360 

ggtgctgggc 420 

gatcgagccg 480 

ggggaacaag 540 

cgagctgcag 600 

cgagcgcaaa 660 

caaggagaat 720 

ctacatggag 780 

ccaggaggac 840 

caaggccgct 900 

cctggtaggc 960 

gctgaacggc 1020 

gttcgggaag 1080 

cgcggaagcc 1140 

ctttcccttc 1200 

ggccaaggtc 1260 

aaagtgcaag 1320 

catgtgcgcc 1380 

ccggatgttc 1440 

gcaggaagtc 1500 

gttctacgtc 1560 

cgagcccaag 1620 

tccggtggac 1680 

gatgctgttt 1740 

gcacgggacc 1800 
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agagactgtt cagaatgttt ccccggcgtg tcagaatctc aaccggtcgt cagaaaaagg 1860 

acgtatcgga aactctgtgc gattcatcat ctgctggggc gggctcccga gattgcttgc 1920 

tcggcctgcg atctggtcaa cgtggacctg gatgactgtg tctctgagca ataaatgact 1980 

taaaccaggt atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga 2040 

gggcattcgc gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca 2100 

aaagcaggac gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa 2160 

cggactcgac aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa 2220 

ggcctacgac cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga 2280 

cgccgagttt caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc 2340 

agtcttccag gccaagaagc gggttctcga accttttggt ctggttgagg aaggcgctaa 2400 

gacggctcct ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc 2460 

gggcatcggc aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg 2520 

cgactcagag tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc 2 580 

tgtgggacct actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg 2640 

cgccgacgga gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga 2700 

cagagtcatc accaccagca cccgcacctg ggccttgccc acctacaata accacctcta 2760 

caagcaaatc tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag 2820 

caccccctgg gggtattttg atttcaacag attccactgc cacttttcac cgcgtgactg 2880 

gcagcgactc atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt 2940 

caacatccaa gtcaaggagg tcacgacgag tgatggcgtc acaaccatcg ctaataacct 3000 

taccagcacg gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc 3060 

tgcgcaccag ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg 3120 

ctacctgacg ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga 3180 

atatttccct tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga 3240 

ggaagtgcct ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc 3300 

tctcatcgac caatacctgt attacctgaa cagaactcaa aatcagtccg gaagtgccca 3360 

aaacaaggac ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa 3420 

ctggctacct ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa caggcaacaa 3480 

caacagcaat tttacctgga ctggtgcttc aaaatataac ctcaatgggc atgaatccat 3540 

catcaaccct ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat 3600 

gagcggtgtc atgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa 3660 

tgtcatgatt acagacgaag aggaaattaa agccactaac cctgtggcca ccgaaagatt 3720 

tgggaccgtg gcagtcaatt tccagagcag cagcacacac cctgcgaccg gagatgtgca 3780 

tgttatggga gcattacctg gcatggtgtg gcaagataga gacgtgtacc tgcagggtcc 3840 

catctgggcc aaaattcctc acacagatgg acactttcac ccgtctcctc ttatgggcgg 3900 

ctttggactc aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa 3960 

tcctccggcg gagttttcag ctacaaagtt tgcttcattc atcacccaat actccacagg 4020 

acaagtgagt gtggaaattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc 4080 

cgaagtgcag tacacatcca attatgcaaa atctgccaac gttgatttta ccgtggacaa 4140 

caatggactt tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta 4200 

attacgtgtt aatcaataaa ccggttgatt cgtttcagtt gaactttggt ctcctgtcc 4259 

<210> 2 
<211> 4305 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 2 

accctcacta aagggactag tcctgcaggt ttaaacgaat tcgccctttg cgacattttg 60 

cgacaccacg tggccattca gggtatatat ggccgagtga gcgagcagga tccccatttt 120 

gaccgcgaaa tttgaacgag cagcagccat gccgggcttc tacgagatcg tgatcaaggt 180 

gccgagcgac ctggacgagc acctgccggg catttctgac tcgtttgtga actgggtggc 240 

cgagaaggaa tgggagctgc ccccggattc tgacatggat ctgaatctga ttgagcaggc 300 

acccctgacc gtggccgaga agctgcagcg cgacttcctg gtccaatggc gccgcgtgag 360 

taaggccccg gaggccctct tctttgttca gttcgagaag ggcgagtcct acttccacct 420 

ccatattctg gtggagacca cgggggtcaa atccatggtg ctgggccgct tcttgagtca 480 

gattagggac aagctggtgc agaccatcta ccgcgggatc gagccgaccc tgcccaactg 540 

gttcgcggtg accaagacgc gtaatggcgc cggagggggg aacaaggtgg tggacgagtg 600 

ctacatcccc aactacctcc tgcccaagac tcagcccgag ctgcagtggg cgtggactaa 660 
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catggaggag tatataagcg cgtgtttgaa cctggccgag cgcaaacggc tcgtggcgca 720 

gcacctgacc cacgtcagcc agacccagga gcagaacaag gagaatctga accccaattc 780 

tgacgcgcct gtcatccggt caaaaacctc cgcgcgctac atggagctgg tcgggtggct 840 

ggtggaccgg ggcatcacct ccgagaagca gtggatccag gaggaccagg cctcgtacat 900 

ctccttcaac gccgcctcca actcgcggtc ccagatcaag gccgctctgg acaatgccgg 960 

caagatcatg gcgctgacca aatccgcgcc cgactacctg gtaggccccg ctcctcccgc 1020 

ggacattaaa accaaccgca tctaccgcat cctggagctg aacggctacg accctgccta 1080 

cgccggctcc gtctttctcg gctgggccca gaaacggttc gggaagcgca acaccatctg 1140 

gctgtttggg ccggccacca cgggcaagac caacatcgcg gaagccatcg cccacgccgt 1200 

gcccttctac ggctgcgtca actggaccaa tgagaacttt cccttcaacg attgcgtcga 1260 

caagatggtg atctggtggg aggagggcaa gatgacggcc aaggtcgtgg agtccgccaa 1320 

ggccattctc ggcggcagca aggtgcgcgt ggaccaaaag tgcaagtcgt ccgcccagat 1380 

cgatcccacc cccgtgatcg tcacctccaa caccaacatg tgcgccgtga ttgacgggaa 1440 

cagcaccacc ttcgagcacc agcagccgtt gcaggaccgg atgttcaaat ttgaactcac 1500 

ccgccgtctg gagcatgact ttggcaaggt gacaaagcag gaagtcaaag agttcttccg 1560 

ctgggcgcag gatcacgtga ccgaggtggc gcatgagttc tacgtcagaa agggtggagc 1620 

caacaaaaga cccgcccccg atgacgcgga taaaagcgag cccaagcggg cctgcccctc 1680 

agtcgcggat ccatcgacgt cagacgcgga aggagctccg gtggactttg ccgacaggta 1740 

ccaaaacaaa tgttctcgtc acgcgggcat gcttcagatg ctgtttccct gcaagacatg 1800 

cgagagaatg aatcagaatt tcaacatttg cttcacgcac gggaccagag actgttcaga 1860 

atgtttcccc ggcgtgtcag aatctcaacc ggtcgtcaga aaaaggacgt atcggaaact 1920 

ctgtgcgatt catcatctgc tggggcgggc tcccgagatt gcttgctcgg cctgcgatct 1980 

ggtcaacgtg gacctggatg actgtgtctc tgagcaataa atgacttaaa ccaggtatgg 2040 

ctgccgatgg ttatcttcca gattggctcg aggacaacct ctctgagggc attcgcgagt 2100 

ggtgggactt gaaacctgga gccccgaaac ccaaagccaa ccagcaaaag caggacgacg 2160 

gccggggtct ggtgcttcct ggctacaagt acctcggacc cttcaacgga ctcgacaagg 2220 

gggagcccgt caacgcggcg gacgcagcgg ccctcgagca cgacaaggcc tacgaccagc 2280 

agctcaaagc gggtgacaat ccgtacctgc ggtataacca cgccgacgcc gagtttcagg 2340 

agcgtctgca agaagatacg tcttttgggg gcaacctcgg gcgagcagtc ttccaggcca 2400. 

agaagcgggt tctcgaacct tttggtctgg ttgaggaagg cgctaagacg gctcctggaa 2460 

agaaacgtcc ggtagagcag tcgccacaag agccagactc ctcctcgggc atcggcaaga 2520 

caggccagca gcccgctaaa aagagactca attttggtca gactggcgac tcagagtcag 2580 

tccccgaccc acaacctctc ggagaacctc cagcaacccc cgctgctgtg ggacctacta 2640 

caatggcttc aggcggtggc gcaccaatgg cagacaataa cgaaggcgcc gacggagtgg 2700 

gtaatgcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga gtcatcacca 2760 

ccagcacccg cacctgggcc ttgcccacct acaataacca cctctacaag caaatctcca 2820 

gtgcttcaac gggggccagc aacgacaacc actacttcgg ctacagcacc ccctgggggt 2880 

attttgattt caacagattc cactgccact tttcaccgcg tgactggcag cgactcatca 2940 

acaacaattg gggattccgg cccaagagac tcaacttcaa gctcttcaac atccaagtca 3000 

aggaggtcac gacgagtgat ggcgtcacaa ccatcgctaa taaccttacc agcacggttc 3060 

aagtcttctc ggactcggag taccagcttc cgtacgtcct cggctctgcg caccagggct 3120 

gcctccctcc gttcccggcg gacgtgttca tgattccgca atacggctac ctgacgctca 3180 

acaatggcag ccaagccgtg ggacgttcat ccttttactg cctggaatat ttcccttctc 3240 

agatgctgag aacgggcaac aactttacct tcagctacac ctttgaggaa gtgcctttcc 3300 

acagcagcta cgcgcacagc cagagcctgg accggctgat gaatcctctc atcgaccaat 3360 

acctgtatta cctgaacaga actcaaaatc agtccggaag tgcccaaaac aaggacttgc 3420 

tgtttagccg tgggtctcca gctggcatgt ctgttcagcc caaaaactgg ctacctggac 3480 

cctgttatcg gcagcagcgc gtttctaaaa caaaaacagg caacaacaac agcaatttta 3540 

cctggactgg tgcttcaaaa tataacctca atgggcatga atccatcatc aaccctggca 3600 

ctgctatggc ctcacacaaa gacgacgaag acaagttctt tcccatgagc ggtgtcatga 3660 

tttttggaaa agagagcgcc ggagcttcaa acactgcatt ggacaatgtc atgattacag 3720 

acgaagagga aattaaagcc actaaccctg tggccaccga aagatttggg accgtggcag 3780 

tcaatttcca gagcagcagc acacaccctg cgaccggaga tgtgcatgtt atgggagcat 3840 

tacctggcat ggtgtggcaa gatagagacg tgtacctgca gggtcccatc tgggccaaaa 3900 

ttcctcacac agatggacac tttcacccgt ctcctcttat gggcggcttt ggactcaagc 3960 

acccgcctcc tcagatcctc atcaaaaaca cgcctgttcc tgcgaatcct ccggcggagt 4020 

tttcagctac aaagtttgct tcattcatca cccaatactc cacaggacaa gtgagtgtgg 4080 

aaattgaatg ggagctgcag aaagaaaaca gcaagcgctg gaatcccgaa gtgcagtaca 4140 

catccaatta tgcaaaatct gccaacgttg attttaccgt ggacaacaat ggactttata 4200 

ctgagcctcg ccccattggc acccgttacc ttacccgtcc cctgtaatta cgtgttaatc 4260 

aataaaccgg ttgattcgtt tcagttgaac tttggtctcc tgtcc 4305 

<210> 3 
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<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of Artificial sequence: 
synthethic construct 

<221> misc_feature 

<222> 745, 826, 845, 932, 960, 1003 

<223> N can be A, C, G, or T 



Note = 



<400> 3 

ccgcgagtga 

tgccgggatt 

gcatttctga 

ctgacatgga 

gcgaattcct 

agttcgagaa 

aatccatggt 

accgcggggt 

ccggaggcgg 

cccagcccga 

atctcgcgga 

agcagaacaa 

gcgaggtaca 

tggantccag 

acaaatcaag 

ggactacctg 

cctcgagatg 

aaagaagttc 

caacatcgct 

tgagaacttt 

gatgacggcc 

ggaccaaaag 

caccaacatg 

ccaagaccgg 

caccaagcag 

tcacgagttt 

tataagtgag 

agctccggtg 

tctgatgctt 

cacgcacggg 

tgtcgtcaga 

gcccgaggtg 

ggagcaataa 

acaacctctc 

aggcaaatca 

tcggacccgg 

tcgaacacga 

acaaccacgc 

acctcggacg 

aggaagcggc 

cggactcctc 

ttggtcagac 

cagccccctc 

acaataacga 

aatggctggg 

acaatcacct 

ttggctacag 

cacgtgactg 

tcaagctctt 

ccaataacct 



gcgaaccagg 
ctacgagatt 
ctcttttgta 
tctgaatctg 
ggtcgagtgg 
gggggacagc 
ggtgggccgc 
cgagccgcag 
gaacaaggtg 
gctccagtgg 
gcgtaaacgg 
agagaaccag 
tggagctggt 
gaggaccagg 
gccgcactgg 
gtgggaaaca 
aacgggtacg 
gggaagagga 
gaagctatcg 
ccgttcaacg 
aaggtcgtgg 
tgcaagtcat 
tgcgcggtca 
atgttcaagt 
gaagtcaagg 
tacgtcagaa 
cccaagcggg 
gactacgcgg 
tttccctgcc 
gtcatggact 
aagcggacat 
gcttgttcgg 
atgactcaaa 
tgaaggcgtt 
acaacatcag 
caacggactt 
caaggcctac 
cgacgccgag 
agcagtcttc 
taagacggct 
ttcgggcatc 
tggcgacaca 
tggtgtggga 
gggtgccgat 
cgacagagtc 
ctacaagcaa 
caccccctgg 
gcaaagactc 
taacattcaa 
taccagcacg 



agctccattt 
gtcctgaagg 
aactgggtgg 
attgagcagg 
cgccgcgtga 
tacttccacc 
tacgtgagcc 
cttccgaact 
gtggacgact 
gcgtggacta 
ctggtggcgc 
aaccnaattc 
cgggtggctg 
cctcttacat 
acaatgcctc 
acccgccgga 
atccgcagta 
acaccatctg 
cccacgccgt 
attgcgtcga 
agtccgccaa 
cggcccagat 
tcgacggaaa 
tcgagctcac 
actttttccg 
agggtggagc 
cctgtccgtc 
acaggtacca 
ggcaatgcga 
gtgccgagtg 
atcagaaact 
cctgcgatct 
ccagatatga 
cgagagtggt 
gacaacgctc 
gacaaggggg 
gaccagcagc 
tttcaggagc 
caggccaaaa 
cctggaaaaa 
ggcaaatcag 
gagtcagtcc 
tctactacaa 
ggagtgggta 
atcaccacca 
atctccagcc 
gggtattttg 
atcaacaaca 
gtcaaagagg 
gttcaggtgt 



tgcccgcgaa 
tgcccagcga 
cggagaagga 
cacccctaac 
gtaaggcccc 
tacacattct 
agattaaaga 
ggttcgcggt 
gctacatccc 
atatggacca 
agcatctgac 
tgacgcgccg 
gtggacccng 
ctccttcaac 
cnaaattatg 
ggacattact 
cgcggcctcc 
gctctttggg 
gcccttttac 
caagatggtg 
ggccattctg 
cgacccaact 
ttccaccacc 
caagcgcctg 
gtgggcgtca 
tagaaagagg 
agttgcgcag 
aaacaaatgt 
gagaatgaat 
cttccccgtg 
gtgtccgatt 
ggccaatgtg 
ctgacggtta 
gggcgctgca 
ggggtcttgt 
aacccgtcaa 
tcaaggccgg 
gtcttcaaga 
agaggatcct 
agagacctgt 
gccagcagcc 
cagaccctca 
tggcttcagg 
attcctcagg 
gcacccgcac 
aatcaggagc 
acttcaacag 
actggggatt 
tcacgcagaa 
ttactgactc 
Page 



ttttgaacga 
cctggacgag 
atgggagctg 
cgtggccgaa 
ggaggccctc 
ggtggagacc 
gaagctggtg 
gaccaagacg 
caactacctg 
gtatttaagc 
gcacgtgtcg 
gtgattcgat 
ggatcncgtc 
gccgcctcca 
agcctgacaa 
canaaccgga 
gtcttcctgg. 
ccggccacga 
ggctgcgtga 
atctggtggg 
ggcggaagca 
cccgtcatcg 
ttcgagcacc 
gagcacgact 
gatcacgtga 
cccgccccca 
ccatcgacgt 
tctcgtcacg 
cagaatgtgg 
tcagaatctc 
catcacatca 
gacttggatg 
ccttccagat 
acctggagcc 
gcttccgggt 
cgcagcggac 
tgacaacccc 
agatacgtct 
tgagcctctg 
agagcaatct 
cgctagaaaa 
accactcgga 
cggtggcgca 
aaattggcat 
ctgggccctg 
caccaacgac 
attccactgc 
ccgacccaag 
tgacggtacg 
cgagtaccag 
4 



gcagcagcca 
cacctgcctg 
ccgccggatt 
aagctgcaac 
ttctttgttc 
gtgggcgtga 
acccgcatct 
cgtaatggcg 
ctccccaaga 
gcctgtttga 
cagacgcagg 
caaaacctcc 
agaaaagcaa 
actcgcggtc 
aaacggctcc 
tctacaaaat 
gctgggcgca 
cgggtaaaac 
actggaccaa 
aggagggcaa 
aggtgcgcgt 
tcacctccaa 
aacaaccact 
ttggcaaggt 
ctgaggtgtc 
atgacgcaga 
cagacgcgga 
tgggcatgaa 
acatttgctt 
aacccgtgtc 
tggggagggc 
actgtgacat 
tggctagagg 
cctaaaccca 
tacaaatacc 
gcggcagccc 
tacctcaagt 
tttgggggca 
ggtctggttg 
ccagcagaac 
agactgaatt 
caacctcccg 
ccaatggcag 
tgcgattccc 
cccacctaca 
aaccactact 
cacttttcac 
agactcaact 
acgacgattg 
ctcccgtacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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tcctcggctc ggcgcatcag ggatgcctcc cgccgttccc agcagacgtc ttcatggtcc 3060 

cacagtatgg atacctcacc ctgaacaacg ggagtcaggc ggtaggacgc tcttcctttt 3120 

actgcctgga gtactttcct tctcagatgc tgcgtactgg aaacaacttt cagtttagct 3180 

acacttttga agacgtgcct ttccacagca gctacgctca cagccaaagt ctggaccgtc 3240 

tcatgaatcc tctgatcgac cagtacctgt actatctgaa caggacacaa acagccagtg 3300 

gaactcagca gtctcggcta ctgtttagcc aagctggacc caccagtatg tctcttcaag 3360 

ctaaaaactg gctgcctgga ccttgctaca gacagcagcg tctgtcaaag caggcaaacg 3420 

acaacaacaa cagcaacttt ccctggactg gtgccaccaa atatcatctg aatggccggg 3480 

actcattggt gaacccgggc cctgctatgg ccagtcacaa ggatgacaaa gaaaagtttt 3540 

tccccatgca tggaaccctg atatttggta aagaaggaac aaatgccaac aacgcggatt 3600 

tggaaaatgt catgattaca gatgaagaag aaatccgcac caccaatccc gtggctacgg 3660 

agcagtacgg gactgtgtca aataatttgc aaaactcaaa cgctggtcca actactggaa 3720 

ctgtcaatca ccaaggagcg ttacctggta tggtgtggca ggatcgagac gtgtacctgc 3780 

agggacccat ttgggccaag attcctcaca ccgatggaca ctttcatcct tctccactga 3840 

tgggaggttt tgggctcaaa cacccgcctc ctcagatcat gatcaaaaac actcccgttc 3900 

cagccaatcc tcccacaaac tttagtgcgg caaagtttgc ttccttcatc acacagtact 3960 

ccacggggca ggtcagcgtg gagatcgagt gggagctgca gaaggagaac agcaaacgct 4020 

ggaatcccga aattcagtac acttccaact acaacaaatc tgttaatgtg gactttactg 4080 

tggacactaa tggtgtgtat tcagagcctc gccccattgg caccagatac ctgactcgta 4140 

atctgtaatt gcttgttaat caataaaccg gttgattcgt ttcagttgaa ctttgg 4196 

<210> 4 
<211> 3005 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 4 

tggtgggagg agggcaagat gacggccaag gtcgtggagt ccgccaaggc cattctcggc 60 

ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg cccagatcga tcccaccccc 120 

gtgatcgtca cctccaacac caacatgtgc gccgtgattg acgggaacag caccaccttc 180 

gagcaccagc agccgttgca ggaccggatg ttcaaatttg aactcacccg ccgtctggag 240 

catgactttg gcaaggtgac aaagcaggaa gtcaaagagt tcttccgctg ggcgcaggat 300 

cacgtgaccg aggtggcgca tgagttctac gtcagaaagg gtggagccaa caaaagaccc 360 

gcccccgatg acgcggataa aagcgagccc aagcgggcct gcccctcagt cgcggatcca 420 

tcgacgtcag acgcggaagg agctccggtg gactttgccg acaggtacca aaacaaatgt 480 

tctcgtcacg cgggcatgct tcagatgctg tttccctgca agacatgcga gagaatgaat 540 

cagaatttca acatttgctt cacgcacggg accagagact gttcagaatg tttccccggc 600 

gtgtcagaat ctcaaccggt cgtcagaaaa aggacgtatc ggaaactctg tgcgattcat 660 

catctgctgg ggcgggctcc cgagattgct tgctcggcct gcgatctggt caacgtggac 720 

ctggatgact gtgtctctga gcaataaatg acttaaacca ggtatggctg ccgatggtta 780 

tcttccagat tggctcgagg acaacctctc tgagggcatt cgcgagtggt gggacttgaa 840 

acctggagcc ccgaaaccca aagccaacca gcaaaagcag gacgacggcc ggggtctggt 900 

gcttcctggc tacaagtacc tcggaccctt caacggactc gacaaggggg agcccgtcaa 960 

cgcggcggac gcagcggccc tcgagcacga caaggcctac gaccagcagc tcaaagcggg 1020 

tgacaatccg tacctgcggt ataaccacgc cgacgccgag tttcaggagc gtctgcaaga 1080 

agatacgtct tttgggggca acctcgggcg agcagtcttc caggccaaga agcgggttct 1140 

cgaacctttt ggtctggttg aggaaggcgc taagacggct cctggaaaga aacgtccggt 1200 

agagcagtcg ccacaagagc cagactcctc ctcgggcatc ggcaagacag gccagcagcc 1260 

cgctaaaaag agactcaatt ttggtcagac tggcgactca gagtcagtcc ccgacccaca 1320 

acctctcgga gaacctccag caacccccgc tgctgtggga cctactacaa tggcttcagg 1380 

cggtggcgca ccaatggcag acaataacga aggcgccgac ggagtgggta atgcctcagg 1440 

aaattggcat tgcgattcca catggctggg cgacagagtc atcaccacca gcacccgcac 1500 

ctgggccttg cccacctaca ataaccacct ctacaagcaa atctccagtg cttcaacggg 1560 

ggccagcaac gacaaccact acttcggcta cagcaccccc tgggggtatt ttgatttcaa 1620 

cagattccac tgccactttt caccacgtga ctggcagcga ctcatcaaca acaattgggg 1680 

attccggccc aagagactca acttcaagct cttcaacatc caagtcaagg aggtcacgac 1740 

gaatgatggc gtcacaacca tcgctaataa ccttaccagc acggttcaag tcttctcgga 1800 

ctcggagtac cagcttccgt acgtcctcgg ctctgcgcac cagggctgcc tccctccgtt 1860 

cccggcggac gtgttcatga ttccgcaata cggctacctg acgctcaaca atggcagcca 1920 

Page 5 



WO 2006/119432 PCT/US2006/017157 



140140433Pl.TXT 

agccgtggga cgttcatcct tttactgcct ggaatatttc ccttctcaga tgctgagaac 1980 

gggcaacaac tttaccttca gctacacctt tgaggaagtg cctttccaca gcagctacgc 2040 

gcacagccag agcctggacc ggctgatgaa tcctctcatc gaccaatacc tgtattacct 2100 

gaacagaact caaaatcagt ccggaagtgc ccaaaacaag gacttgctgt ttagccgtgg 2160 

gtctccagct ggcatgtctg ttcagcccaa aaactggcta cctggaccct gttatcggca 2220 

gcagcgcgtt tctaaaacaa aaacagacaa caacaacagc aattttacct ggactggtgc 2280 

ttcaaaatat aacctcaatg ggegtgaatc catcatcaac cctggcactg ctatggcctc 2340 

acacaaagac gacgaagaca agttctttcc catgagcggt gtcatgattt ttggaaaaga 2400 

gagcgccgga gcttcaaaca ctgcattgga caatgtcatg attacagacg aagaggaaat 2460 

taaagccact aaccctgtgg ccaccgaaag atttgggacc gtggcagtca atttccagag 2520 

cagcagcaca gaccctgcga ccggagatgt gcatgttatg ggagcattac ctggcatggt 2580 

gtggcaagat agagacgtgt acctgcaggg tcccatctgg gccaaaattc ctcacacaga 2640 

tggacacttt cacccgtctc ctcttatggg cggctttgga ctcaagcacc cgcctcctca 2700 

gatcctcatc aaaaacacgc ctgttcctgc gaatcctccg gcggagtttt cagctacaaa 2760 

gtttgcttca ttcatcaccc aatactccac aggacaagtg agtgtggaaa ttgaatggga 2820 

gctgcagaaa gaaaacagca agcgctggaa tcccgaagtg cagtacacat ccaattatgc 2880 

aaaatctgcc aacgttgatt ttaccgtgga caacaatgga ctttatactg agcctcgccc 2940 

cattggcacc cgttacctta cccgtcccct gtaattacgt gttaatcaat aaaccgtttm 3000 

attcg 3005 

<210> 5 
<2U> 3014 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 6 

<223> N can be A, C, G, or T (U) 
<400> 5 

atgctnatgt ggtgggagga gggcaagatg acggccaagg tcgtggagtc cgccaaggcc 60 

attctcggcg gcagcaaagt gcgcgtggac caaaagtgca agtcgtccgc ccagatcgat 120 

cccacccccg tgatcgtcac ctccaacacc aacatgtgcg ccgtgattga cgggaacagc 180 

accaccttcg agcaccagca gccgttgcag gaccggatgt tcaaatttga actcacccgc 240 

cgtctggagc atgactttgg caaggtgaca aaacaggaag tcaaagagtt cttccgctgg 300 

gcgcaggatc acgtgaccga ggtggcgcat gagttctacg tcagaaaggg tggagccaac 360 

aagagacccg cccccgatga cgcggataaa agcgagccca agcgggtctg cccctcagtc 420 

gcggatccat cgacgtcaga cgcggaagga gctccggtgg actttgccga caggtaccaa 480 

aacaaatgtt ctcgtcacgc gggcatgctt cagatgctgt ttccctgcaa aacatgcgag 540 

agaatgaatc agaatttcaa catttgcttc acgcacggga ccagagactg ttcagaatgt 600 

ttccccggcg tgtcagaatc tcaaccggtc gtcagaaaaa ggacgtatcg gaaactctgt 660 

gccattcatc atctgctggg gcgggctccc gagattgctt gctcggcctg cgatctggtc 720 

aacgtggacc tggatgactg tgtttctgag caataaatga cttaaaccag gtatggctgc 780 

cgatggttat cttccagatt ggctcgagga caacctctct gagggcattc gcaagtggtg 840 

ggacttgaaa cctggagccc cgaaacccaa agccaaccag caaaagcagg acgacggccg 900 

gggtctggtg cttcctggct acaagtacct cggacccttc aacggactcg acaaggggga 960 

gcccgtcaac gcggcggacg cagcggccct cgagcacgac aaggcctacg accagcagct 1020 

caaagcgggt gacaatccgt acctgcggta taaccacgcc gacgccgagt ttcaggagcg 1080 

tctgcaagaa gatacgtctt ttgggggcaa cctcgggaga gcagtcttcc aggccaagaa 1140 

gcgggttctc gaaccttttg gtctggttga ggaaggcgct aagacggctc ctggaaagaa 1200 

acgtccggta gagcagtcgc cacaagagcc agactcctcc tcgggcatcg gcaagacagg 1260 

ccagcagccc gctaaaaaga gactcaattt tggtcagact ggcgactcag agtcagtccc 1320 

cgacccacaa cctctcggag aacctccagc aacccccgct gctgtgggac ctactacaat 1380 

ggcttcaggc ggtggcgcac caatggcaga caataacgaa ggcgccgacg gagtgggtaa 1440 

tgcctcagga aattggcatt gcgattccac atggctgggc gacagagtca tcaccaccag 1500 

cacccgcacc tgggccttgc ccacctacaa taaccacctc tacaagcaaa tctccagtgc 1560 

ttcaacgggg gccagcaacg acaaccacta cttcggctac agcaccccct gggggtattt 1620 

tgatttcaac agattccact gccacttttc accacgtgac tggcagcgac tcatcaacaa 1680 

caattgggga ttccggccca agagactcaa cttcaagctc ttcaacatcc aagtcaagga 1740 
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ggtcacgacg aatgatggcg tcacaaccat cgctaataac cttaccagca 
cttctcggac tcggagtacc agcttccgta cgtcctcggc tctgcgcacc 
ccctccgttc ccggcggacg tgttcatgat tccgcaatac ggctacctga 
tggcagccaa gccgtgggac gttcatcctt ttactgcctg gaatatttcc 
gctgagaacg ggcaacaact ttaccttcag ctacaccttt gaggaagtgc 
cagctacgcg cacagccaga gcctggaccg gctgatgaat cctctcatcg 
gtattacctg aacagaactc aaaatcagtc cggaagtgcc caaaacaagg 
tagccgtggg tctccagctg gcatgtctgt tcagcccaaa aactggctac 
ttatcggcag cagcgcgttt ctaaaacaaa aacagacaac aacaacagca 
gactggtgct tcaaaatata acctcaatgg gcgtgaatcc atcatcaacc 
tatggcctca cacaaagacg acaaagacaa gttctttccc atgagcggtg 
tggaaaagag agcgccggag cttcaaacac tgcattggac aatgtcatga 
agaggaaatt aaagccacta accctgtggc caccgaaaga tttgggaccg 
tttccagagc agcagcacag accctgcgac cggagatgtg catgttatgg 
tggcatggtg tggcaagata gagacgtgta cctgcagggt cccatttggg 
tcacacagat ggacactttc acccgtctcc tcttatgggc ggctttggac 
gcctcctcag atcctcatca aaaacacacc tgttcctgcg aatcctccgg 
agctacaaag tttgcttcat tcatcaccca atactccaca ggacaagtga 
tgaatgggag ctgcagaaag aaaacagcaa gcgctggaat cccgaagtgc 
caattatgca aaatctgcca acgttgattt taccgtggac aacaatggac 
gcctcgcccc attggcaccc gttaccttac ccgtcccctg taattacgtg 
aaccgkttha ttcg 

<210> 6 
<211> 3010 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
synthethic construct 



cggttcaagt 
agggctgcct 
cgctcaacaa 
cttctcagat 
ctttccacag 
accaatacct 
acttgctgtt 
ctggaccctg 
attttacctg 
ctggcactgc 
tcatgatttt 
ttacagacga 
tggcagtcaa 
gagcattacc 
ccaaaattcc 
tcaagcaccc 
cggagttttc 
gtgtggaaat 
agtacacatc 
tttatactga 
ttaatcaata 



Note 



<400> 6 

tgatttggtg 

tcggcggcag 

cccccgtgat 

ccttcgagca 

tggagcatga 

aggatcacgt 

gacccgcccc 

atccatcgac 

aatgttctcg 

tgaatcagaa 

ccggcgtgtc 

ttcatcatct 

tggacctgga 

ggttatcttc 

ttgaaacctg 

ctggtgcttc 

gtcaacgcgg 

gcgggtgaca 

caagaagata 

gttctcgaac 

ccggtagagc 

cagcccgcta 

ccacaacctc 

tcaggcggtg 

tcaggaaatt 

cgcacctggg 

acgggggcca 

ttcaacagat 

tggggattcc 

acgacgaatg 



ggaggagggc 
caaggtgcgc 
cgtcacctcc 
ccagcagccg 
ctttggcaag 
gaccgaggtg 
cgatgacgcg 
gtcagacgcg 
tcacgcgggc 
tttcaacatt 
agaatctcaa 
gctggggcgg 
tgactgtgtc 
cagattggct 
gagccccgaa 
ctggctacaa 
cggacgcagc 
atccgtacct 
cgtcttttgg 
cttttggtct 
agtcgccaca 
aaaagagact 
tcggagaacc 
gcgcaccaat 
ggcattgcga 
ccttgcccac 
gcaacgacaa 
tccactgcca 
ggcccaagag 
atggcgtcac 



aagatgacgg 
gtggaccaaa 
aacaccaaca 
ttgcaggacc 
gtgacaaagc 
gcgcatgagt 
gataaaagcg 
gaaggagctc 
atgcttcaga 
tgcttcacgc 
ccggtcgtca 
gctcccgaga 
tctgagcaat 
cgaggacaac 
acccaaagcc 
gtacctcgga 
ggccctcgag 
gcggtataac 
gggcaacctc 
ggttgaggaa 
agagccagac 
caattttggt 
tccagcaacc 
ggcagacaat 
ttccacatgg 
ctacaataac 
ccactacttc 
cttttcacca 
actcaacttc 
aaccatcgct 



ccaaggtcgt 
agtgcaagtc 
tgtgcgccgt 
ggatgttcaa 
aggaagtcaa 
tctacgtcag 
agcccaagcg 
cggtggactt 
tgctgtttcc 
acgggaccag 
gaaaaaggac 
ttgcttgctc 
aaatgactta 
ctctctgagg 
aaccagcaaa 
cccttcaacg 
cacgacaagg 
cacgccgacg 
gggcgagcag 
ggcgctaaga 
tcctcctcgg 
cagactggcg 
cccgctgctg 
aacgaaggcg 
ctgggcgaca 
cacctctaca 
ggctacagca 
cgtgactggc 
aagctcttca 
aataacctta 
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ggagtccgcc 
gtccgcccrg 
gattgacggg 
atttgaactc 
agagttcttc 
aaagggtgga 
ggcctgcccc 
tgccgacagg 
ctgcaagaca 
agactgttca 
gtatcggaaa 
ggcctgcgat 
aaccaggtat 
gcattcgcga 
agcaggacaa 
gactcgacaa 
cctacgacca 
ccgagtttca 
tcttccaggc 
cggctcctgg 
gcatcggcaa 
actcagagtc 
tgggacctac 
ccgacggagt 
gagtcatcac 
agcaaatctc 
ccccctgggg 
agcgactcat 
acatccaagt 
ccagcacggt 
7 



aaagccattc 
atcgatccca 
aacagcacca 
acccgccgtc 
cgctgggcgc 
gccaacaaaa 
tcagtcgcgg 
taccaaaaca 
tgcgagagaa 
gaatgtttcc 
ctctgtgcga 
ctggtcaacg 
ggctgccgat 
gtggtgggac 
cggccggggt 
gggggagccc 
gcagctcaaa 
ggagcgtctg 
caagaagcgg 
aaagaaacgt 
gacaggccag 
agtccccgac 
tacaatggct 
gggtaatgcc 
caccagcacc 
cagtgcttca 
gtattttgat 
caacaacaat 
caaggaggtc 
tcaagtcttc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3014 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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tcggactcgg 
ccgttcccgg 
agccaagccg 
agaacgggca 
tacgcgcaca 
tacctgaaca 
cgtgggtctc 
cggcagcagc 
ggtgcttcaa 
gcctcacaca 
aaagagagcg 
gaaattaaag 
cagagcagca 
atggtgtggc 
acagatggac 
cctcagatcc 
acaaagtttg 
tgggagctgc 
tatgcaaaat 
cgccccattg 
gtttyattcg 



agtaccagct 
cggacgtgtt 
tgggacgttc 
acaactttac 
gccagagcct 
gaactcaaaa 
cagctggcat 
gcgtttctaa 
aatataacct 
aagacgacga 
ccggagcttc 
ccactaaccc 
gcacacaccc 
aagatagaga 
actttcaccc 
tcatcaaaaa 
cttcattcat 
agaaagaaaa 
ctgccaacgt 
gcacccgtta 



tccgtacgtc 
catgattccg 
atccttttac 
cttcagctac 
ggaccggctg 
tcagtccgga 
gtctgttcag 
aacaaaaaca 
caatgggcgt 
agacaagttc 
aaacactgca 
tgtggccacc 
tgcgaccgga 
cgtgtacctg 
gtctcctctt 
cacgcctgtt 
cacccaatac 
cagcaagcgc 
tgattttacc 
ccttacccgt 



140140433P1.TXT 
ctcggctctg cgcaccaggg 
caatacggct acctgacgct 
tgcctggaat atttcccttc 
acctttgagg aagtgccttt 
atgaatcctc tcatcgacca 
agtgcccaaa acaaggactt 
cccaaaaact ggctacctgg 
gacaacaaca acagcaattt 
gaatccatca tcaaccctgg 
tttcccatga gcggtgtcat 
ttggacaatg tcatgattac 
gaaagatttg ggaccgtggc 
gatgtgcatg ttatgggagc 
cagggtccca tctgggccaa 
atgggcggct ttggactcaa 
cctgcgaatc ctccggcgga 
tccacaggac aagtgagtgt 
tggaatcccg aagtgcagta 
gtggacaaca atggacttta 
cccctgtaat tacgtgttaa 



ctgcctccct 
caacaatggc 
tcagatgctg 
ccacagcagc 
atacctgtat 
gctgtttagc 
accctgttat 
tacctggact 
cactgctatg 
gatttttgga 
agacgaagag 
agtcaatttc 
attacctggc 
aattcctcac 
gcacccgcct 
gttttcagct 
ggaaattgaa 
cacatccaat 
tactgagcct 
tcaataaacc 



<210> 7 
<211> 3008 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
synthethic construct 



Note = 



<400> 7 

atytggtggg 

ggcggcagca 

cccgtgatcg 

ttcgagcacc 

gagcacgact 

gatcacgtga 

cccgcccccg 

ccatcgacgt 

tgttctcgtc 

aatcagaatt 

ggcgtgtcag 

catcatctgc 

gacatggatg 

ttatcttcca 

gaaacctgga 

ggtgcttcct 

caacgcggcg 

gggtgacaat 

agaagatacg 

tctcgaacct 

ggtagagcag 

gcccgctaaa 

acaacctctc 

aggcggtggc 

aggaaattgg 

cacctgggcc 

gggggccagc 

caacagattc 

gggattccgg 

gacgaatgat 

ggactcggag 



aggagggcaa 
aggtgcgcgt 
tcacctccaa 
agcagccgtt 
ttggcaaggt 
ccgaggtggc 
atgacgcgga 
cagacgcgga 
acgcgggcat 
tcaacatttg 
aatctcaacc 
tggggcgggc 
rctgtgtttc 
gattggctcg 
gccccgaaac 
ggctacaagt 
gacgcagcgg 
ccgtacctgc 
tcttttgggg 
tttggtctgg 
tcgccacaag 
aagagactca 
ggagaacctc 
gcaccaatgg 
cattgcgatt 
ttgcccacct 
aacgacaacc 
cactgccact 
cccaagagac 
ggcgtcacaa 
taccagcttc 



gatgacggcc 
ggaccaaaag 
caccaacatg 
gcaggaccgg 
gacaaagcag 
gcatgagttc 
taaaagcgag 
aggagctccg 
gcttcagatg 
cttcacgcac 
ggtcgtcaga 
tcccgagatt 
tgagcaataa 
aggacaacct 
ccaaagccaa 
acctcggacc 
ccctcgagca 
ggtataacca 
gcaacctcgg 
ttgaggaagg 
agccagactc 
attttggtca 
cagcaacccc 
cagacaataa 
ccacatggct 
acaataacca 
actacttcgg 
tttcaccacg 
tcaacttcaa 
ccatcgctaa 
cgtacgtcct 



aaggtcgtgg 
tgcaagtcgt 
tgcgccgtga 
atgttcaaat 
gaagtcaaag 
tacgtcagaa 
cccaagcggg 
gtggactttg 
ctgtttccct 
gggaccagag 
aaaaggacgt 
gcttgctcgg 
atgacttaaa 
ctctgagggc 
ccagcaaaag 
cttcaacgga 
cgacaaggcc 
cgccgacgcc 
gcgagcagtc 
cgctaagacg 
ctcctcgggc 
gactggcgac 
cgctgctgtg 
cgaaggcgcc 
gggcgacaga 
cctctacaag 
ctacagcacc 
tgactggcag 
gctcttcaac 
taaccttacc 
cggctctgcg 
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agtccgccaa 
ccgcccagat 
ttgacgggaa 
ttgaactcac 
agttcttccg 
agggtggagc 
cctgcccctc 
ccgacaggta 
gcaaaacatg 
actgttcaga 
atcggaaact 
cctgcgatct 
ccaggtatgg 
attcgcgagt 
caggacgacg 
ctcgacaagg 
tacgaccagc 
gagtttcagg 
ttccaggcca 
gctcctggaa 
atcggcaaga 
tcagagtcag 
ggacctacta 
gacggagtgg 
gtcatcacca 
caaatctcca 
ccctgggggt 
cgactcatca 
atccaagtca 
agcacggttc 
caccagggct 
8 



ggccattctc 
cgatcccacc 
cagcaccacc 
ccgccgtctg 
ctgggcgcag 
caacaagaga 
agtcgcggat 
ccaaaacaaa 
cgagagaatg 
gtgctttccc 
ctgtgccatt 
ggtcaacgtg 
ctgccgatgg 
ggtgggactt 
gccggggtct 
gggagcccgt 
agctcaaagc 
agcgtctgca 
aaaagcgggt 
agaaacgtcc 
caggccagca 
tccccgaccc 
caatggcttc 
gtaatgcctc 
ccagcacccg 
gtgcttcaac 
attttgattt 
acaacaattg 
aggaggtcac 
aagtcttctc 
gcctccctcc 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3010 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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gttcccggcg gacgtgttca tgattccgca atacggctac ctgacgctca acaatggcag 1920 

ccaagccgtg ggacgttcat ccttttattg cctggaatat ttcccatcgc agatgctgag 1980 

aacgggcaat aactttacct tcagctacac ctttgaggac gtgcctttcc acagcagcta 2040 

cgcgcacagc cagagcctgg accggctgat gaatcctctc atcgaccagt acctgtatta 2100 

cctgaacaga actcagaacc agtccggaag tgcccaaaac aaggacttgc tgtttagccg 2160 

ggggtctcca gctggcatgt ctgttcagct caaaaactgg ctacctggac cctgttatcg 2220 

gcagcagcgc gtttctaaaa caaaaacaga caacaacaac agcaatttta cctggactgg 2280 

tgcttcaaaa tataacctta atgggcgtga atccatcatc aaccctggca ctgctatggc 2340 

ctcacacaaa gacgacgaag acaagttctt tcccatgagc ggtgtcatga tttttggaaa 2400 

agagagcgcc ggagcttcaa acactgcatt ggacaatgtc atgattacag acgaagagga 2460 

aattaaagcc actaaccctg tggccaccga aagatttggg accgtggcag tcaatttcca 2520 

gagcagcagc acagaccctg cgaccggaga tgtgcatgtt atgggagcat tacctggcat 2580 

ggtgtggcaa gatagagacg tgtacctgca gggtcccatt tgggccaaaa ttcctcacac 2640 

agatggacac tttcacccgt ctcctcttat gggcggcttt ggactcaaac acccgcctcc 2700 

tcagatcctc atcaaaaaca cacctgttcc tgcgaatcct ccggcggagt tttcagctac 2760 

aaagtttgct tcattcatca cccaatactc cacaggacaa gtgagcgtgg aaattgaatg 2820 

ggagctgcag aaagaaaaca gcaagcgctg gaatcccgaa gtgcagtaca catccaatta 2880 

tgcaaaatct gccaacgttg attttaccgt ggacaacaat ggactttata ctgagcctcg 2940 

ccccattggc acccgttacc ttacccgtcc cctgtaatta cgtgttaatc aataaaccgk 3000 

ttaattcg 3008 

<210> 8 
<211> 3021 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 8 . 

atgctdatgt ggtgggagga gggcaagatg acggccaagg tcgtggagtc cgccaaggcc 60 

attctcggcg gcagcaaagt gcgcgtggac caaaagtgca agtcgtccgc ccagatcgac 120 

cccacccccg tcatcgtcac ctccaacacc aacatgtgcg ccgtgattga cgggaacagc 180 

accaccttcg agcaccagca gccgttgcag gaccggatgt tcaaatttga actcacccgc 240 

cgtctggagc acgactttgg caaggtgaca aagcaggaag tcaaagagtt cttccactgg 300 

gcgcaggatc acgtgaccga ggtggcgcat gagttctacg tcagaaaggg tggagccaac 360 

aagagacccg cccccgatga cgcggataaa agcgagccca agcgggcctg cccctcagtc 420 

gcggatccat cgacgtcaga cgcggaagga gctccggtgg actttgccga caggtaccaa 480 

aacaaatgtt ctcgtcacgc gggcatgctt cagatgctgt ttccctgcaa aacatgcgag 540 

agaatgaatc agaatttcaa catttgcttc acgcacggga ccagagactg ttcagaatgt 600 

ttccccggcg tgtcagaatc tcaaccggtc gtcagaaaaa agacgtatcg gaaactctgt 660 

gccattcatc atctgctggg gcgggctccc gagattgctt gctcggcctg cgatctggtc 720 

aatgtggacc tggatgactg tgtttctgag caataaatga cttaaaccag gtatggctgc 780 

cgatggttat cttccagatt ggctcgagga caacctctct gagggcattc gcgagtggtg 840 

ggacttgaaa cctggagccc cgaaacccaa agccaaccag caaaagcagg acgacggccg 900 

gggtctggtg cttcctggct acaagtacct cggacccttc aacggactcg acaaggggga 960 

gcccgtcaac gcggcggacg cagcggccct cgagcacgac aaggcctacg accagcagct 1020 

caaagcgggt gacaatccgt acctgcggta taaccacgcc gacgccgagt ttcaggagcg 1080 

tctgcaagaa gatacgtctt ttgggggcaa cctcgggcga gcagtcttcc aggccaagaa 1140 

gcgggttctc gaaccttttg gtctggttga ggaaggtgct aagacggctc ctggaaagaa 1200 

gagaccggta gagcagtcgc cccaagaacc agactcctca tcgggcatcg gcaaatcagg 1260 

ccagcagccc gctaaaaaga gactcaattt tggtcagact ggcgactcag agtcagtccc 1320 

cgacccacaa cctctcggag aacctccagc aacccccgct gctgtgggac ctactacaat 1380 

ggcttcaggc ggtggcgcac caatggcaga caataacgaa ggcgccgacg gagtgggtaa 1440 

tgcctcagga aattggcatt gcgattccac atggctgggc gacagagtca ttaccaccag 1500 

cacccgaacc tgggccctgc ccacctataa caaccacctc tacaaacaaa tctccagcgc 1560 

ttcaacgggg gccagcaacg acaaccacta cttcggctac agcaccccct gggggtattt 1620 

tgattttaac agattccact gccacttctc accacgtgac tggcagcgac tcatcaacaa 1680 

caattgggga ttccggccca agagactcaa cttcaagctc ttcaacatcc aagtcaagga 1740 

ggtcacgacg aacgatggcg tcacgaccat cgctaataac cttaccagca cggttcaagt 1800 

cttctcggac tcggagtacc agttgccgta cgtcctcggc tctgcgcacc agggctgcct 1860 

ccctccgttc ccggcggacg tgttcatgat tccgcagtac ggctacctaa cactcaacaa 1920 
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tggcagccag gccgtgggac gttcatcctt ttactgcctg gaatatttcc catcgcagat 1980 

gctgagaacg ggcaataact ttaccttcag ctacacattc gaggacgtgc ctttccacag 2040 

cagctacgcg cacagccaaa gcctggaccg gctgatgaat cctctcatcg accagtactt 2100 

gtattaccta aacagaactc aaaatcagtc cggaagtgcc caaaacaagg acttgctgtt 2160 

tagccggggg tctccagctg gcatgtctgt tcagcccaaa aactggctac ctggaccctg 2220 

ttatcggcag cagcgcgttt ctaaaacaaa aacagacaac aacaacagca actttacctg 2280 

gactggtgct tcaaaatata accttaatgg gcgtgaatct ataatcaacc ctggcactgc 2340 

tatggcttca cacaaagacg acgaagacaa gttctttcca atgagcggtg tcatgatttt 2400 

tggcaaggag agcgccggag cttcaaacac tgcattggac aatgtcatga ttacagacga 2460 

agaggaaatt aaagccacta accctgtggc caccgaaaga tttgggaccg tggcagtcaa 2520 

tttccagagc agcagcacag accctgcgac cggagatgtg catgttatgg gagcattacc 2580 

tggcatggtg tggcaagata gagacgtgta cctgcagggt ccaatttggg ccaaaattcc 2640 

tcacacagat ggacactttc acccgtctcc tcttatgggc ggctttggac ttaagcaccc 2700 

gcctcctcag atcctcatca aaaacacgcc tgttcctgcg aatcctccgg cagagttttc 2760 

ggctacaaag tttgcttcat tcatcaccca gtattccaca ggacaagtga gtgtggaaat 2820 

tgaatgggag ttgcagaaag aaaacagcaa gcgttggaat cccgaagtgc agtacacatc 2880 

taattatgca aaatctgcca acgttgattt cactgtggac aacaatggac tttatactga 2940 

gcctcgcccc attggcaccc gttacctcac ccgtcccctg taattacttg ttaatcaata 3000 

aaccgtgtha ttcgtgtcag t . " 3021 

<210> 9 
<211> 4222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 9 

ttgcgacatt ttgcgacacc atgtggccat tcagggtata tatggccgag tgagcgagca 60 

ggatctccat tttgaccgcg aaatttgaac gagcagcagc catgccgggc ttctacgaga 120 

tcgtgatcaa ggtgccgagc gacctggacg agcacctgcc gggcatttct gactcgtttg 180 

tgaactgggt ggccgagaag gaatgggagc tgcccccgga ttctgacatg gatctgaatc 240 

tgattgagca ggcacccctg accgtggccg agaagctgca gcgcgacttc ctggtccaat 300 

ggcgccgcgt gagtaaggcc ccggaggccc tcttctttgt tcagttcgag aagggcgagt 360 

cctacttcca cctccatatt ctggtggaga ccacgggggt caaatccatg gtgctgggcc 420 

gcttcctgag tcagattagg gacaagctgg tgcagaccat ctaccgcggg atcgagccga 480 

ccctgcccaa ctggttcgcg gtgaccaaga cgcgtaatgg cgccggaggg gggaacaagg 540 

tggtggacga gtgctacatc cccaactacc tcctgcccaa gactcagccc gagctgcagt 600 

gggcgtggac taacatggag gagtatataa gcgcgtgttt gaacctggcc gagcgcaaac 660 

ggctcgtggc gcagcacctg acccacgtca gccagaccca ggagcagaac aaggagaatc 720 

tgaacccgaa ttctgacgcg cctgtcatcc ggtcaaaaac ctccgcgcgc tacatggagc 780 

tggtcgggtg gctggtggac cggggcatca cctccgagaa gcagtggatc caggaggacc 840 

aggcctcgta catctccttc aacgccgcct ccaactcgcg gtcccagatc aaggccgctc 900 

tggacaatgc cggcaagatc atggcgctaa ccaaatccgc gcccgactac ctggtaggcc 960 

ccgctccgcc cgcggacatt aaaaccaacc gcatttaccg catcctggag ctgaacggct 1020 

acgaccctgc ctacgccggc tccgtctttc tcggctgggc ccagaaaagg ttcgggaagc 1080 

gcaacaccat ctggctgttt gggccggcca ccacgggcaa gaccaacatc gcggaagcca 1140 

tcgcacacgc cgtgcccttc tacggctgcg tcaactggac caatgaaaac tttcccttca 1200 

acgactgcgt cgacaagatg gtgatctggt gggaggaggg caagatgacg gccaaggtcg 1260 

tggagtccgc caaggccatt ctcggcggca gcaaggtgcg cgtggaccaa aagtgcaagt 1320 

cgtccgccca gatcgatccc acccccgtga tcgtcacctc caacaccaac atgtgcgccg 1380 

tgattgacgg gaacagcacc accttcgagc accagcagcc gttgcaggac cggatgttca 1440 

aatttgaact cacccgccgt ctggagcacg actttggcaa ggtgacaaag caggaagtca 1500 

aagagttctt ccgctgggcg caggatcacg tgaccgaggt ggcgcatgag ttctacgtca 1560 

gaaagggtgg agccaacaag agacccgccc ccgatgacgc ggataaaagc gagcccaagc 1620 

gggtctgccc ctcagtcgcg gatccatcga cgtcagacgc ggaaggagct ccggtggact 1680 

ttgccgacag gtaccaaaac aaatgttctc gtcacgcggg catgcttcag atgctgtttc 1740 

cctgcaaaac atgcgagaga atgaatcaga atttcaacat ttgcttcacg cacgggacca 1800 

gagactgttc agaatgtttc cccggcgtgt cagaatctca accggtcgtc agaaaaagga 1860 

cgtatcggaa actctgtgcc attcatcatc tgctggggcg ggctcccgag attgcttgct 1920 

cggcctgcga tctggtcaac gtggacctgg atgactgtgt ttctgagcaa taaatgactt 1980 
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aaaccaggta tggctgccga tggttatctt ccagattggc tcgaggacaa cctctctgag 2040 

ggcattcgcg agtggtggga cttgaaacct ggagccccga aacccaaagc caaccagcaa 2100 

aagcaggacg acggccgggg tctggtgctt cctggctaca agtacctcgg acccttcaac 2160 

ggactcgaca agggggagcc cgtcaacgcg gcggacgcag cggccctcga gcacgacaag 2220 

gcctacgacc agcagctcaa agcgggtgac aatccgtacc tgcggtataa ccacgccgac 2280 

gccgagtttc aggagcgtct gcaagaagat acgtcttttg ggggcaacct cgggcgagca 2340 

gtcttccagg ccaagaagag ggttctcgaa ccttttggtc tggttgagga aggtgctaag 2400 

acggctcctg gaaagaaacg tccggtagag cagtcacccc aagaaccaga ctcctcctcg 2460 

ggcattggca aatcaggcca gcagcccgct aaaaagagac tcaattttgg tcagactggc 2520 

gactcagagt cagtccccga cccacaacct ctcggagaac ctccagcaac ccccgctgct 2580 

ttgggaccta ctacaatggc ttcaggcggt ggcgcaccaa tggcagacaa taacgaaggc 2640 

gccgacggag tgggtaatgc ctcaggaaat tggcattgcg attccacatg gctgggcgac 2700 

agagtcatca ccaccagcac ccgcacctgg gccttgccca cctacaataa ccacctctac 2760 

aagcaaatct ccagtgcttc aacgggggcc agcaacgaea accactactt cggctacagc 2820 

accccctggg ggtattttga tttcaacaga ttccactgcc atttctcacc acgtgactgg 2880 

cagcgactca tcaacaacaa ctggggattc cggcccaaga gactcaactt caagctcttc 2940 

aacatccaag tcaaggaggt cacgacgaac gatggcgtca cgaccatcgc taataacctt 3000 

accagcacgg ttcaagtctt ctcggactcg gagtaccagc ttccgtacgt cctcggctct 3060 

gcgcaccagg gctgcctccc tccgttcccg gcggacgtgt tcatgattcc gcagtacggc 3120 

tacctgacgc tcaacaatgg cagcaaagcc gtgggacgtt catcctttta ctgcctggaa 3180 

tatttccctt ctcagatgct gagaacgggc aacaacttta ccttcagcta cacctttgag 3240 

gaagtgcctt tccacagcag ctacgcgcac agccagagcc tggaccggct gatgaatcct 3300 

ctcatcgacc aatacctgta ttacctgaac agaactcaaa atcagtccgg aagtgcccaa 3360 

aacaaggact tgctgtttag ccgtgggtct ccagctggca tgtctgttca gcccaaaaac 3420 

tggctacctg gaccctgtta tcggcagcag cgcgtttcta aaacaaaaac agacaacaac 3480 

aacagcaatt ttacctggac tggtgcttca aaatataacc tcaatgggcg tgaatccatc 3540 

atcaaccctg gcactgctat ggcctcacac aaagacgacg aagacaagtt ctttcccatg 3600 

agcggtgtca tgatttttgg aaaagagagc gccggagctt caaacactgc attggacaat 3660 

gtcatgatta cagacgaaga ggaaattaaa gccactaacc ctgtggccac cgaaagattt 3720 

gggaccgtgg cagtcaattt ccagagcagc agcacagacc ctgcgaccgg agatgtgcat 3780 

gttatgggag cattacctgg catggtgtgg caagatagag acgtgtacct gcagggtccc 3840 

atttgggcca aaattcctca cacggatgga cactttcacc cgtctcctct tatgggcggc 3900 

tttggactca agcacccgcc tcctcagatc ctcatcaaaa acacacctgt tcctgcgaat 3960 

cctccggcag agttttcggc tacaaagttt gcttcattca tcacccagta ctccacagga 4020 

caagtgagcg tggaaattga atgggagttg cagaaagaaa acagtaagcg ctggaatccc 4080 

gaagtgcagt acacatctaa ttatgcaaaa tctgccaacg ttgacttcac tgtggacaac 4140 

aatggacttt atactgagcc tcgccccatt ggcacccgtt acctcacccg tcccctgtaa 4200 

ttacttgtta atcaataaac eg 4222 

<210> 10 
<211> 4213 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 10 

ttgcgacagt ttgcgacacc atgtggtcac aagaggtata taaccgegag tgagccagcg 60 

aggagctcca ttttgcccgc gaagtttgaa cgagcagcag ccatgccggg gttctacgag 120 

gtggtgatca aggtgcccag cgacctggac gagcacctgc ccggcatttc tgactccttt 180 

gtgaactggg tggccgagaa ggaatgggag ttgcccccgg attctgacat ggatcagaat 240 

ctgattgagc aggcacccct gaccgtggcc gagaagctgc agegegagtt cctggtggaa 300 

tggegecgag tgagtaaatt tetggaggee aagttttttg tgcagtttga aaagggggac 360 

tegtacttte atttgeatat tctgattgaa attaceggeg tgaaatccat ggtggtgggc 420 

cgctacgtga gtcagattag ggataaactg atccagcgca tctaccgcgg ggtcgagccc 480 

cagctgccca actggttcgc ggtcacaaag acccgaaatg gegceggagg egggaacaag 540 

Qtggtggacg agtgctacat ccccaactac ctgctcccca aggtccagcc cgagcttcag 600 

tgggcgtgga ctaacatgga ggagtatata agcgcctgtt tgaacctcgc ggagcgtaaa 660 

cggctcgtgg cgcagcacct gacgcacgtc tcccagaccc aggagggega caaggagaat 720 

ctgaacccga attctgaege geeggtgate eggtcaaaaa cctccgccag gtacatggag 780 

ctggtcgggt ggctggtgga caagggcatc aegtccgaga agcagtggat ccaggaggac 840 
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caggcctcgt acatctcctt caacgcggcc tccaactccc ggtcgcagat caaggcggcc 900 

ctggacaatg cctccaaaat catgagcctc accaaaacgg ctccggacta tctcatcggg 960 

cagcagcccg tgggggacat taccaccaac cggatctaca aaatcctgga actgaacggg 1020 

tacgaccccc agtacgccgc ctccgtcttt ctcggctggg cccagaaaaa gtttggaaag 1080 

cgcaacacca tctggctgtt tgggcccgcc accaccggca agaccaacat cgcggaagcc 1140 

atcgcccacg cggtcccctt ctacggctgc gtcaactgga ccaatgagaa ctttcccttc 1200 

aacgactgcg tcgacaaaat ggtgatttgg tgggaggagg gcaagatgac cgccaaggtc 1260 

gtagagtccg ccaaggccat tctgggcggc agcaaggtgc gcgtggacca aaaatgcaag 1320 

gcctctgcgc agatcgaccc cacccccgtg atcgtcacct ccaacaccaa catgtgcgcc 1380 

gtgattgacg ggaacagcac caccttcgag caccagcagc ccctgcagga ccggatgttc 1440 

aagtttgaac tcacccgccg cctcgaccac gactttggca aggtcaccaa gcaggaagtc 1500 

aaggactttt tccggtgggc ggctgatcac gtgactgacg tggctcatga gttttacgtc 1560 

acaaagggtg gagctaagaa aaggcccgcc ccctctgacg aggatataag cgagcccaag 1620 

cggccgcgcg tgtcatttgc gcagccggag acgtcagacg cggaagctcc cggagacttc 1680 

gccgacaggt accaaaacaa atgttctcgt cacgcgggta tgctgcagat gctctttccc 1740 

tgcaagacgt gcgagagaat gaatcagaat tccaacgtct gcttcacgca cggtcagaaa 1800 

gattgcgggg agtgctttcc cgggtcagaa tctcaaccgg tttctgtcgt cagaaaaacg 1860 

tatcagaaac tgtgcatcct tcatcagctc cggggggcac ccgagatcgc ctgctctgct 1920 

tgcgaccaac tcaaccccga tttggacgat tgccaatttg agcaataaat gactgaaatc 1980 

aggtatggct gctgacggtt atcttccaga ttggctcgag gacaacctct ctgaaggcat 2040 

tcgcgagtgg tgggcgctga aacctggagc tccacaaccc aaggccaacc aacagcatca 2100 

ggacaacggc aggggtcttg tgcttcctgg gtacaagtac ctcggaccct tcaacggact 2160 

cgacaaggga gagccggtca acgaggcaga cgccgcggcc ctcgagcacg acaaggccta 2220 

cgacaagcag ctcgagcagg gggacaaccc gtatctcaag tacaaccacg ccgacgccga 2280 

gttccagcag cgcttggcga ccgacacctc ttttgggggc aacctcgggc gagcagtctt 2340 

ccaggccaaa aagaggattc tcgagcctct gggtctggtt gaagagggcg ttaaaacggc 2400 

tcctggaaag aaacgcccat tagaaaagac tccaaatcgg ccgaccaacc cggactctgg 2460 

gaaggccccg gccaagaaaa agcaaaaaga cggcgaacca gccgactctg ctagaaggac 2520 

actcgacttt gaagactctg gagcaggaga cggaccccct gagggatcat cttccggaga 2580 

aatgtctcat gatgctgaga tgcgtgcggc gccaggcgga aatgctgtcg aggcgggaca 2640 

aggtgccgat ggagtgggta atgcctccgg tgattggcat tgcgattcca cctggtcaga 2700 

gggccgagtc accaccacca gcacccgaac ctgggtccta cccacgtaca acaaccacct 2760 

gtacctgcga atcggaacaa cggccaacag caacacctac aacggattct ccaccccctg 2820 

gggatacttt gactttaacc gcttccactg ccacttttcc ccacgcgact ggcagcgact 2880 

catcaacaac aactggggac tcaggccgaa atcgatgcgt gttaaaatct tcaacataca 2940 

ggtcaaggag gtcacgacgt caaacggcga gactacggtc gctaataacc ttaccagcac 3000 

ggttcagatc tttgcggatt cgacgtatga actcccatac gtgatggacg ccggtcagga 3060 

ggggagcttt cctccgtttc ccaacgacgt ctttatggtt ccccaatacg gatactgcgg 3120 

agttgtcact ggaaaaaacc agaaccagac agacagaaat gccttttact gcctggaata 3180 

ctttccatcc caaatgctaa gaactggcaa caattttgaa gtcagttacc aatttgaaaa 3240 

agttcctttc cattcaatgt acgcgcacag ccagagcctg gacagaatga tgaatccttt 3300 

actggatcag tacctgtggc atctgcaatc gaccactacc ggaaattccc ttaatcaagg 3360 

aacagctacc accacgtacg ggaaaattac cactggagac tttgcctact acaggaaaaa 3420 

ctggttgcct ggagcctgca ttaaacaaca aaaattttca aagaatgcca atcaaaacta 3480 

caagattccc gccagcgggg gagacgccct tttaaagtat gacacgcata ccactctaaa 3540 

tgggcgatgg agtaacatgg ctcctggacc tccaatggca accgcaggtg ccggggactc 3600 

ggattttagc aacagccagc tgatctttgc cggacccaat ccgagcggta acacgaccac 3660 

atcttcaaac aatttgttgt ttacctcaga agaggagatt gccacaacaa acccacgaga 3720 

cacggacatg tttggacaga ttgcagataa taatcaaaat gccaccaccg cccctcacat 3780 

cgctaacctg gacgctatgg gaattgttcc cggaatggtc tggcaaaaca gagacatcta 3840 

ctaccagggc cctatttggg ccaaggtccc tcacacggac ggacactttc acccttcgcc 3900 

gctgatggga ggatttggac tgaaacaccc gcctccacag attttcatca aaaacacccc 3960 

cgtacccgcc aatcccaata ctacctttag cgctgcaagg attaattctt ttctgacgca 4020 

gtacagcacc ggacaagttg ccgttcagat cgactgggaa attcagaagg agcattccaa 4080 

acgctggaat cccgaagttc aatttacttc aaactacggc actcaaaatt ctatgctgtg 4140 

ggctcccgac aatgctggca actaccacga actccgggct attgggtccc gtttcctcac 4200 

ccaccacttg taa ~~ 4213 

<210> 11 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 



<400> 11 

atggctgccg 

gagtggtggg 

gacggccggg 

aagggggagc 

cagcagctca 

caggagcgtc 

gccaagaagc 

ggaaagaaac 

aagacaggcc 

tcagtccccg 

actacaatgg 

gtgggtaatg 

accaccagca 

tccagtgctt 

gggtattttg 

atcaacaaca 

gtcaaggagg 

gttcaagtct 

ggctgcctcc 

ctcaacaatg 

tctcagatgc 

ttccacagca 

caatacctgt 

ttgctgttta 

ggaccctgtt 

tttacctgga 

ggcactgcta 

atgatttttg 

acagacgaag 

gcagtcaatt 

gcattacctg 

aaaattcctc 

aagcacccgc 

gagttttcag 

gtggaaattg 

tacacatcca 

tatactgagc 



atggttatct 
acttgaaacc 
gtctggtgct 
ccgtcaacgc 
aagcgggtga 
tgcaagaaga 
gggttctcga 
gtccggtaga 
agcagcccgc 
acccacaacc 
cttcaggcgg 
cctcaggaaa 
cccgcacctg 
caacgggggc 
atttcaacag 
attggggatt 
tcacgacgag 
tctcggactc 
ctccgttccc 
gcagccaagc 
tgagaacggg 
gctacgcgca 
attacctgaa 
gccgtgggtc 
atcggeagca 
ctggtgcttc 
tggcctcaca 
gaaaagagag 
aggaaattaa 
tccagagcag 
gcatggtgtg 
acacagatgg 
ctcctcagat 
ctacaaagtt 
aatgggagct 
attatgcaaa 
ctcgccccat 



tccagattgg 
tggagccccg 
tcctggctac 
ggcggacgca 
caatccgtac 
tacgtctttt 
accttttggt 
gcagtcgcca 
taaaaagaga 
tctcggagaa 
tggcgcacca 
ttggcattgc 
ggccttgccc 
cagcaacgac 
attccactgc 
ccggcccaag 
tgatggcgtc 
ggagtaccag 
ggcggacgtg 
cgtgggacgt 
caacaacttt 
cagccagagc 
cagaactcaa 
tccagctggc 
gcgcgtttct 
aaaatataac 
caaagacgac 
cgccggagct 
agccactaac 
cagcacacac 
gcaagataga 
acactttcac 
cctcatcaaa 
tgcttcattc 
gcagaaagaa 
atctgccaac 
tggcacccgt 



ctcgaggaca 
aaacccaaag 
aagtacctcg 
gcggccctcg 
ctgcggtata 

gggggcaacc 

ctggttgagg 
caagagccag 
ctcaattttg 
cctccagcaa 
atggcagaca 
gattccacat 
acctacaata 
aaccactact 
cacttttcac 
agactcaact 
acaaccatcg 
cttccgtacg 
ttcatgattc 
tcatcctttt 
accttcagct 
ctggaccggc 
aatcagtccg 
atgtctgttc 
aaaacaaaaa 
ctcaatgggc 
gaagacaagt 
tcaaacactg 
cctgtggcca 
cctgcgaccg 
gacgtgtacc 
ccgtctcctc 
aacacgcctg 
atcacccaat 
aacagcaagc 
gttgatttta 
taccttaccc 



acctctctga 
ccaaccagca 
gacccttcaa 
agcacgacaa 
accacgccga 
tcgggcgagc 
aaggcgctaa 
actcctcctc 
gtcagactgg 
cccccgctgc 
ataacgaagg 
ggctgggcga 
accacctcta 
tcggctacag 
cgcgtgactg 
tcaagctctt 
ctaataacct 
tcctcggctc 
cgcaatacgg 
actgcctgga 
acacctttga 
tgatgaatcc 
gaagtgccca 
agcccaaaaa 
caggcaacaa 
atgaatccat 
tctttcccat 
cattggacaa 
ccgaaagatt 
gagatgtgca 
tgcagggtcc 
ttatgggcgg 
ttcctgcgaa 
actccacagg 
gctggaatcc 
ccgtggacaa 
gtcccctgta 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcatcggc 
cgactcagag 
tgtgggacct 
cgccgacgga 
cagagtcatc 
caagcaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctgacg 
atatttccct 
ggaagtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catctgggcc 
ctttggactc 
tcctccggcg 
acaagtgagt 
cgaagtgcag 
caatggactt 
a 



<210> 12 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
synthethic construct 



Note = 



<400> 12 

atggctgccg 

gagtggtggg 

gacggccggg 

aagggggagc 

cagcagctca 

caggagcgtc 

gccaagaagc 

ggaaagaaac 

aagacaggcc 

tcagtccccg 



atggttatct 
acttgaaacc 
gtctggtgct 
ccgtcaacgc 
aagcgggtga 
tgcaagaaga 
gggttctcga 
gtccggtaga 
agcagcccgc 
acccacaacc 



tccagattgg 
tggagccccg 
tcctggctac 
ggcggacgca 
caatccgtac 
tacgtctttt 
accttttggt 
gcagtcgcca 
taaaaagaga 
tctcggagaa 



ctcgaggaca 
aaacccaaag 
aagtacctcg 
gcggccctcg 
ctgcggtata 
gggggcaacc 
ctggttgagg 
caagagccag 
ctcaattttg 
cctccagcaa 
Page 



acctctctga 
ccaaccagca 
gacccttcaa 
agcacgacaa 
accacgccga 
tcgggcgagc 
aaggcgctaa 
actcctcctc 
gtcagactgg 
cccccgctgc 
13 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcatcggc 
cgactcagag 
tgtgggacct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2211 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg 
gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga 
accaccagca cccgcacctg ggccttgccc acctacaata accacctcta 
tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag 
gggtattttg atttcaacag attccactgc cacttttcac cgcgtgactg 
atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt 
gtcaaggagg tcacgacgag tgatggcgtc acaaccatcg ctaataacct 
gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc 
ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg 
ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga 
tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga 
ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc 
caatacctgt attacctgaa cagaactcaa aatcagtccg gaagtgccca 
ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa 
ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa caggcaacaa 
tttacctgga ctggtgcttc aaaatataac ctcaatgggc atgaatccat 
ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat 
atgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa 
acagacgaag aggaaattaa agccactaac cctgtggcca ccgaaagatt 
gcagtcaatt tccagagcag cagcacacac cctgcgaccg gagatgtgca 
gcattacctg gcatggtgtg gcaagataga gacgtgtacc tgcagggtcc 
aaaattcctc acacagatgg acactttcac ccgtctcctc ttatgggcgg 
aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa 
gagttttcag ctacaaagtt tgcttcattc atcacccaat actccacagg 
gtggaaattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc 
tacacatcca attatgcaaa atctgccaac gttgatttta ccgtggacaa 
tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta 

<210> 13 

<211> 2202 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 



cgccgacgga 
cagagtcatc 
caagcaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctgacg 
atatttccct 
ggaagtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catctgggcc 
ctttggactc 
tcctccggcg 
acaagtgagt 
cgaagtgcag 
caatggactt 



<400> 13 

atgactgacg 

tggtgggcgc 

gctcggggtc 

ggggaacccg 

cagctcaagg 

gagcgtcttc 

aaaaagagga 

aaaaagagac 

tcaggccagc 

gtcccagacc 

acaatggctt 

ggtaattcct 

accagcaccc 

agccaatcag 

tttgacttca 

aacaactggg 

gaggtcacgc 

gtgtttactg 

ctcccgccgt 

aacgggagtc 

atgctgcgta 

agcagctacg 

ctgtactatc 

agccaagctg 

tacagacagc 



gttaccttcc 
tgcaacctgg 
ttgtgcttcc 
tcaacgcagc 
ccggtgacaa 
aagaagatac 
tccttgagcc 
ctgtagagca 
agcccgctag 
ctcaaccact 
caggcggtgg 
caggaaattg 
gcacctgggc 
gagccaccaa 
acagattcca 
gattccgacc 
agaatgacgg 
actccgagta 
tcccagcaga 
aggcggtagg 
ctggaaacaa 
ctcacagcca 
tgaacaggac 
gacccaccag 
agcgtctgtc 



agattggcta 
agcccctaaa 
gggttacaaa 
ggacgcggca 
cccctacctc 
gtcttttggg 
tctgggtctg 
atctccagca 
aaaaagactg 
cggacaacct 
cgcaccaatg 
gcattgcgat 
cctgcccacc 
cgacaaccac 
ctgccacttt 
caagagactc 
tacgacgacg 
ccagctcccg 
cgtcttcatg 
acgctcttcc 
ctttcagttt 
aagtctggac 
acaaacagcc 
tatgtctctt 
aaagcaggca 



gaggacaacc 
cccaaggcaa 
tacctcggac 
gccctcgaac 
aagtacaacc 
ggcaacctcg 
gttgaggaag 
gaaccggact 
aattttggtc 
cccgcagccc 
gcagacaata 
tcccaatggc 
tacaacaatc 
tactttggct 
tcaccacgtg 
aacttcaagc 
attgccaata 
tacgtcctcg 
gtcccacagt 
ttttactgcc 
agctacactt 
cgtctcatga 
agtggaactc 
caagctaaaa 
aacgacaaca 
Page 



tctctgaagg 
atcaacaaca 
ccggcaacgg 
acgacaaggc 
acgccgacgc 
gacgagcagt 
cggctaagac 
cctcttcggg 
agactggcga 
cctctggtgt 
acgagggtgc 
tgggcgacag 
acctctacaa 
acagcacccc 
actggcaaag 
tctttaacat 
accttaccag 
gctcggcgca 
atggatacct 
tggagtactt 
ttgaagacgt 
atcctctgat 
agcagtctcg 
actggctgcc 
acaacagcaa 
14 



cgttcgagag 
tcaggacaac 
acttgacaag 
ctacgaccag 
cgagtttcag 
cttccaggcc 
ggctcctgga 
catcggcaaa 
cacagagtca 
gggatctact 
cgatggagtg 
agtcatcacc 
gcaaatctcc 
ctgggggtat 
actcatcaac 
tcaagtcaaa 
cacggttcag 
tcagggatgc 
caccctgaac 
tccttctcag 
gcctttccac 
cgaccagtac 
gctactgttt 
tggaccttgc 
ctttccctgg 
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actggtgcca ccaaatatca tctgaatggc cgggactcat tggtgaaccc 
atggccagtc acaaggatga caaagaaaag tttttcccca tgcatggaac 
ggtaaagaag gaacaaatgc caacaacgcg gatttggaaa atgtcatgat 
gaagaaatcc gcaccaccaa tcccgtggct acggagcagt acgggactgt 
ttgcaaaact caaacgctgg tccaactact ggaactgtca atcaccaagg 
ggtatggtgt ggcaggatcg agacgtgtac ctgcagggac ccatttgggc 
cacaccgatg gacactttca tccttctcca ctgatgggag gttttgggct 
cctcctcaga tcatgatcaa aaacactccc gttccagcca atcctcccac 
gcggcaaagt ttgcttcctt catcacacag tactccacgg ggcaggtcag 
gagtgggagc tgcagaagga gaacagcaaa cgctggaatc ccgaaattca 
aactacaaca aatctgttaa tgtggacttt actgtggaca ctaatggtgt 
cctcgcccca ttggcaccag atacctgact cgtaatctgt aa 

<210> 14 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 14 

atggctgccg atggttatct tccagattgg ctcgaggaca 
gagtggtggg acttgaaacc tggagccccg aaacccaaag 
gacggccggg gtctggtgct tcctggctac aagtacctcg 
aagggggagc ccgtcaacgc ggcggacgca gcggccctcg 
cagcagctca aagcgggtga caatccgtac ctgcggtata 
caggagcgtc tgcaagaaga tacgtctttt gggggcaacc 
gccaagaagc gggttctcga accttttggt ctggttgagg 
ggaaagaaac gtccggtaga gcagtcgcca caagagccag 
aagacaggcc agcagcccgc taaaaagaga ctcaattttg 
tcagtccccg acccacaacc tctcggagaa cctccagcaa 
actacaatgg cttcaggcgg tggcgcacca atggcagaca 
gtgggtaatg cctcaggaaa ttggcattgc gattccacat 
accaccagca cccgcacctg ggccttgccc acctacaata 
tccagtgctt caacgggggc cagcaacgac aaccactact 
gggtattttg atttcaacag attccactgc cacttttcac 
atcaacaaca attggggatt ccggcccaag agactcaact 
gtcaaggagg tcacgacgaa tgatggcgtc acaaccatcg 
gttcaagtct tctcggactc ggagtaccag cttccgtacg 
ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc 
ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt 
tctcagatgc tgagaacggg caacaacttt accttcagct 
ttccacagca gctacgcgca cagccagagc ctggaccggc 
caatacctgt attacctgaa cagaactcaa aatcagtccg 
ttgctgttta gccgtgggtc tccagctggc atgtctgttc 
ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa 
tttacctgga ctggtgcttc aaaatataac ctcaatgggc 
ggcactgcta tggcctcaca caaagacgac gaagacaagt 
atgatttttg gaaaagagag cgccggagct tcaaacactg 
acagacgaag aggaaattaa agccactaac cctgtggcca 
gcagtcaatt tccagagcag cagcacagac cctgcgaccg 
gcattacctg gcatggtgtg gcaagataga gacgtgtacc 
aaaattcctc acacagatgg acactttcac ccgtctcctc 
aagcacccgc ctcctcagat cctcatcaaa aacacgcctg 
gagttttcag ctacaaagtt tgcttcattc atcacccaat 
gtggaaattg aatgggagct gcagaaagaa aacagcaagc 
tacacatcca attatgcaaa atctgccaac gttgatttta 
tatactgagc ctcgccccat tggcacccgt taccttaccc 



gggccctgct 
cctgatattt 
tacagatgaa 
gtcaaataat 
agcgttacct 
caagattcct 
caaacacccg 
aaactttagt 
cgtggagatc 
gtacacttcc 
gtattcagag 



acctctctga 

ccaaccagca 

gacccttcaa 

agcacgacaa 

accacgccga 

tcgggcgagc 

aaggcgctaa 

actcctcctc 

gtcagactgg 

cccccgctgc 

ataacgaagg 

ggctgggcga 

accacctcta 

tcggctacag 

cacgtgactg 

tcaagctctt 

ctaataacct 

tcctcggctc 

cgcaatacgg 

actgcctgga 

acacctttga 

tgatgaatcc 

gaagtgccca 

agcccaaaaa 

cagacaacaa 

gtgaatccat 

tctttcccat 

cattggacaa 

ccgaaagatt 

gagatgtgca 

tgcagggtcc 

ttatgggcgg 

ttcctgcgaa 

actccacagg 

gctggaatcc 

ccgtggacaa 

gtcccctgta 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcatcggc 
cgactcagag 
tgtgggacct 
cgccgacgga 
cagagtcatc 
caagcaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctgacg 
atatttccct 
ggaagtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catctgggcc 
ctttggactc 
tcctccggcg 
acaagtgagt 
cgaagtgcag 
caatggactt 
a 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 15 

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga 
aagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca 
gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa 
aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa 
cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga 
caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggagagc 
gccaagaagc gggttctcga accttttggt ctggttgagg aaggcgctaa 
ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc 
aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg 
tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc 
actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg 
gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga 
accaccagca cccgcacctg ggccttgccc acctacaata accacctcta 
tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag 
gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg 
atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt 
gtcaaggagg tcacgacgaa tgatggcgtc acaaccatcg ctaataacct 
gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc 
ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg 
ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga 
tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga 
ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc 
caatacctgt attacctgaa cagaactcaa aatcagtccg gaagtgccca 
ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa 
ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa cagacaacaa 
tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat 
ggcactgcta tggcctcaca caaagacgac aaagacaagt tctttcccat 
atgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa 
acagacgaag aggaaattaa agccactaac cctgtggcca ccgaaagatt 
gcagtcaatt tccagagcag cagcacagac cctgcgaccg gagatgtgca 
gcattacctg gcatggtgtg gcaagataga gacgtgtacc tgcagggtcc 
aaaattcctc acacagatgg acactttcac ccgtctcctc ttatgggcgg 
aagcacccgc ctcctcagat cctcatcaaa aacacacctg ttcctgcgaa 
gagttttcag ctacaaagtt tgcttcattc atcacccaat actccacagg 
gtggaaattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc 
tacacatcca attatgcaaa atctgccaac gttgatttta ccgtggacaa 
tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta 

<210> 16 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcatcggc 
cgactcagag 
tgtgggacct 
cgccgacgga 
cagagtcatc 
caagcaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctgacg 
atatttccct 
ggaagtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catttgggcc 
ctttggactc 
tcctccggcg 
acaagtgagt 
cgaagtgcag 
caatggactt 



<400> 16 

atggctgccg 

gagtggtggg 

aacggccggg 

aagggggagc 

cagcagctca 

caggagcgtc 

gccaagaagc 



atggttatct 
acttgaaacc 
gtctggtgct 
ccgtcaacgc 
aagcgggtga 
tgcaagaaga 
gggttctcga 



tccagattgg 
tggagccccg 
tcctggctac 
ggcggacgca 
caatccgtac 
tacgtctttt 
accttttggt 



ctcgaggaca 
aaacccaaag 
aagtacctcg 
gcggccctcg 
ctgcggtata 
gggggcaacc 
ctggttgagg 
Page 



acctctctga 
ccaaccagca 
gacccttcaa 
agcacgacaa 
accacgccga 
tcgggcgagc 
aaggcgctaa 
16 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
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ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480 

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc 780 

tccagtgctt caacgggggc cagcaacgac aaccactact tcggctacag caccccctgg 840 

gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900 

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 

gtcaaggagg tcacgacgaa tgatggcgtc acaaccatcg ctaataacct taccagcacg 1020 

gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc tgcgcaccag 1080 

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140 

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt actgcctgga atatttccct 1200 

tctcagatgc tgagaacggg caacaacttt accttcagct acacctttga ggaagtgcct 1260 

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 

caatacctgt attacctgaa cagaactcaa aatcagtccg gaagtgccca aaacaaggac 1380 

ttgctgttta gccgtgggtc tccagctggc atgtctgttc agcccaaaaa ctggctacct 1440 

ggaccctgtt atcggcagca gcgcgtttct aaaacaaaaa cagacaacaa caacagcaat 1500 

tttacctgga ctggtgcttc aaaatataac ctcaatgggc gtgaatccat catcaaccct 1560 

ggcactgcta tggcctcaca caaagacgac gaagacaagt tctttcccat gagcggtgtc 1620 

atgatttttg gaaaagagag cgccggagct tcaaacactg cattggacaa tgtcatgatt 1680 

acagacgaag aggaaattaa agccactaac cctgtggcca ccgaaagatt tgggaccgtg 1740 

gcagtcaatt tccagagcag cagcacacac cctgcgaccg gagatgtgca tgttatggga 1800 

gcattacctg gcatggtgtg gcaagataga gacgtgtacc tgcagggtcc catctgggcc 1860 

aaaattcctc acacagatgg acactttcac ccgtctcctc ttatgggcgg ctttggactc 1920 

aagcacccgc ctcctcagat cctcatcaaa aacacgcctg ttcctgcgaa tcctccggcg 1980 

gagttttcag ctacaaagtt tgcttcattc atcacccaat actccacagg acaagtgagt 2040 

gtggaaattg aatgggagct gcagaaagaa aacagcaagc gctggaatcc cgaagtgcag 2100 

tacacatcca attatgcaaa atctgccaac gttgatttta ccgtggacaa caatggactt 2160 

tatactgagc ctcgccccat tggcacccgt taccttaccc gtcccctgta a 2211 

<210> 17 
<211> 2211 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 17 

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 

aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240 

cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 

caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 

gccaaaaagc gggttctcga accttttggt ctggttgagg aaggcgctaa gacggctcct 420 

ggaaagaaac gtccggtaga gcagtcgcca caagagccag actcctcctc gggcatcggc 480 

aagacaggcc agcagcccgc taaaaagaga ctcaattttg gtcagactgg cgactcagag 540 

tcagtccccg acccacaacc tctcggagaa cctccagcaa cccccgctgc tgtgggacct 600 

actacaatgg cttcaggcgg tggcgcacca atggcagaca ataacgaagg cgccgacgga 660 

gtgggtaatg cctcaggaaa ttggcattgc gattccacat ggctgggcga cagagtcatc 720 

accaccagca cccgcacctg ggccttgccc acctacaata accacctcta caagcaaatc 780 

tccagtgctt caacgggggc cagcaacgac aaccactact. tcggctacag caccccctgg 840 

gggtattttg atttcaacag attccactgc cacttttcac cacgtgactg gcagcgactc 900 

atcaacaaca attggggatt ccggcccaag agactcaact tcaagctctt caacatccaa 960 

gtcaaggagg tcacgacgaa tgatggcgtc acaaccatcg ctaataacct taccagcacg 1020 

gttcaagtct tctcggactc ggagtaccag cttccgtacg tcctcggctc tgcgcaccag 1080 

ggctgcctcc ctccgttccc ggcggacgtg ttcatgattc cgcaatacgg ctacctgacg 1140 

ctcaacaatg gcagccaagc cgtgggacgt tcatcctttt attgcctgga atatttccca 1200 

tcgcagatgc tgagaacggg caataacttt accttcagct acacctttga ggacgtgcct 1260 

ttccacagca gctacgcgca cagccagagc ctggaccggc tgatgaatcc tctcatcgac 1320 
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cagtacctgt 
ttgctgttta 
ggaccctgtt 
tttacctgga 
ggcactgcta 
atgatttttg 
acagacgaag 
gcagtcaatt 
gcattacctg 
aaaattcctc 
aaacacccgc 
gagttttcag 
gtggaaattg 
tacacatcca 
tatactgagc 



attacctgaa 
gccgggggtc 
atcggcagca 
ctggtgcttc 
tggcctcaca 
gaaaagagag 
aggaaattaa 
tccagagcag 
gcatggtgtg 
acacagatgg 
ctcctcagat 
ctacaaagtt 
aatgggagct 
attatgcaaa 
ctcgccccat 



cagaactcag 
tccagctggc 
gcgcgtttct 
aaaatataac 
caaagacgac 
cgccggagct 
agccactaac 
cagcacagac 
gcaagataga 
acactttcac 
cctcatcaaa 
tgcttcattc 
gcagaaagaa 
atctgccaac 
tggcacccgt 



140140433Pl.TXT 
aaccagtccg gaagtgccca* 
atgtctgttc agctcaaaaa 
aaaacaaaaa cagacaacaa 
cttaatgggc gtgaatccat 
gaagacaagt tctttcccat 
tcaaacactg cattggacaa 
cctgtggcca ccgaaagatt 
cctgcgaccg gagatgtgca 
gacgtgtacc tgcagggtcc 
ccgtctcctc ttatgggcgg 
aacacacctg ttcctgcgaa 
atcacccaat actccacagg 
aacagcaagc gctggaatcc 
gttgatttta ccgtggacaa 
taccttaccc gtcccctgta 



aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catttgggcc 
ctttggactc 
tcctccggcg 
acaagtgagc 
cgaagtgcag 
caatggactt 
a 



<210> 18 
<211> 2211 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial sequence: 
synthethic construct 



Note = 



<400> 18 

atggctgccg 

gagtggtggg 

gacggccggg 

aagggggagc 

cagcagctca 

caggagcgtc 

gccaagaagc 

ggaaagaaga 

aaatcaggcc 

tcagtccccg 

actacaatgg 

gtgggtaatg 

accaccagca 

tccagcgctt 

gggtattttg 

atcaacaaca 

gtcaaggagg 

gttcaagtct 

ggctgcctcc 

ctcaacaatg 

tcgcagatgc 

ttccacagca 

cagtacttgt 

ttgctgttta 

ggaccctgtt 

tttacctgga 

ggcactgcta 

atgatttttg 

acagacgaag 

gcagtcaatt 

gcattacctg 

aaaattcctc 

aagcacccgc 

gagttttcgg 

gtggaaattg 

tacacatcta 

tatactgagc 



atggttatct 
acttgaaacc 
gtctggtgct 
ccgtcaacgc 
aagcgggtga 
tgcaagaaga 
gggttctcga 
gaccggtaga 
agcagcccgc 
acccacaacc 
cttcaggcgg 
cctcaggaaa 
cccgaacctg 
caacgggggc 
attttaacag 
attggggatt 
tcacgacgaa 
tctcggactc 
ctccgttccc 
gcagccaggc 
tgagaacggg 
gctacgcgca 
attacctaaa 
gccgggggtc 
atcggcagca 
ctggtgcttc 
tggcttcaca 
gcaaggagag 
aggaaattaa 
tccagagcag 
gcatggtgtg 
acacagatgg 
ctcctcagat 
ctacaaagtt 
aatgggagtt 
attatgcaaa 
ctcgccccat 



tccagattgg 
tggagccccg 
tcctggctac 
ggcggacgca 
caatccgtac 
tacgtctttt 
accttttggt 
gcagtcgccc 
taaaaagaga 
tctcggagaa 
tggcgcacca 
ttggcattgc 
ggccctgccc 
cagcaacgac 
attccactgc 
ccggcccaag 
cgatggcgtc 
ggagtaccag 
ggcggacgtg 
cgtgggacgt 
caataacttt 
cagccaaagc 
cagaactcaa 
tccagctggc 
gcgcgtttct 
aaaatataac 
caaagacgac 
cgccggagct 
agccactaac 
cagcacagac 
gcaagataga 
acactttcac 
cctcatcaaa 
tgcttcattc 
gcagaaagaa 
atctgccaac 
tggcacccgt 



ctcgaggaca 
aaacccaaag 
aagtacctcg 
gcggccctcg 
ctgcggtata 

gggggcaacc 
ctggttgagg 
caagaaccag 
ctcaattttg 
cctccagcaa 
atggcagaca 
gattccacat 
acctataaca 
aaccactact 
cacttctcac 
agactcaact 
acgaccatcg 
ttgccgtacg 
ttcatgattc 
tcatcctttt 
accttcagct 
ctggaccggc 
aatcagtccg 
atgtctgttc 
aaaacaaaaa 
cttaatgggc 
gaagacaagt 
tcaaacactg 
cctgtggcca 
cctgcgaccg 
gacgtgtacc 
ccgtctcctc 
aacacgcctg 
atcacccagt 
aacagcaagc 
gttgatttca 
tacctcaccc 
Page 



acctctctga 
ccaaccagca 
gacccttcaa 
agcacgacaa 
accacgccga 
tcgggcgagc 
aaggtgctaa 
actcctcatc 
gtcagactgg 
cccccgctgc 
ataacgaagg 
ggctgggcga 
accacctcta 
tcggctacag 
cacgtgactg 
tcaagctctt 
ctaataacct 
tcctcggctc 
cgcagtacgg 
actgcctgga 
acacattcga 
tgatgaatcc 
gaagtgccca 
agcccaaaaa 
cagacaacaa 
gtgaatctat 
tctttccaat 
cattggacaa 
ccgaaagatt 
gagatgtgca 
tgcagggtcc 
ttatgggcgg 
ttcctgcgaa 
attccacagg 
gttggaatcc 
ctgtggacaa 
gtcccctgta 
18 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcatcggc 
cgactcagag 
tgtgggacct 
cgccgacgga 
cagagtcatt 
caaacaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctaaca 
atatttccca 
ggacgtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaac 
aatcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
aatttgggcc 
ctttggactt 
tcctccggca 
acaagtgagt 
cgaagtgcag 
caatggactt 
a 
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<210> 19 
<211> 2211 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note 
synthethic construct 



<400> 19 

atggctgccg 

gagtggtggg 

gacggccggg 

aagggggagc 

cagcagctca 

caggagcgtc 

gccaagaaga 

ggaaagaaac 

aaatcaggcc 

tcagtccccg 

actacaatgg 

gtgggtaatg 

accaccagca 

tccagtgctt 

gggtattttg 

atcaacaaca 

gtcaaggagg 

gttcaagtct 

ggctgcctcc 

ctcaacaatg 

tctcagatgc 

ttccacagca 

caatacctgt 

ttgctgttta 

ggaccctgtt 

tttacctgga 

ggcactgcta 

atgatttttg 

acagacgaag 

gcagtcaatt 

gcattacctg 

aaaattcctc 

aagcacccgc 

gagttttcgg 

gtggaaattg 

tacacatcta 

tatactgagc 



atggttatct 
acttgaaacc 
gtctggtgct 
ccgtcaacgc 
aagcgggtga 
tgcaagaaga 
gggttctcga 
gtccggtaga 
agcagcccgc 
acccacaacc 
cttcaggcgg 
cctcaggaaa 
cccgcacctg 
caacgggggc 
atttcaacag 
actggggatt 
tcacgacgaa 
tctcggactc 
ctccgttccc 
gcagcaaagc 
tgagaacggg 
gctacgcgca 
attacctgaa 
gccgtgggtc 
atcggcagca 
ctggtgcttc 
tggcctcaca 
gaaaagagag 
aggaaattaa 
tccagagcag 
gcatggtgtg 
acacggatgg 
ctcctcagat 
ctacaaagtt 
aatgggagtt 
attatgcaaa 
ctcgccccat 



tccagattgg 
tggagccccg 
tcctggctac 
ggcggacgca 
caatccgtac 
tacgtctttt 
accttttggt 
gcagtcaccc 
taaaaagaga 
tctcggagaa 
tggcgcacca 
ttggcattgc 
ggccttgccc 
cagcaacgac 
attccactgc 
ccggcccaag 
cgatggcgtc 
ggagtaccag 
ggcggacgtg 
cgtgggacgt 
caacaacttt 
cagccagagc 
cagaactcaa 
tccagctggc 
gcgcgtttct 
aaaatataac 
caaagacgac 
cgccggagct 
agccactaac 
cagcacagac 
gcaagataga 
acactttcac 
cctcatcaaa 
tgcttcattc 
gcagaaagaa 
atctgccaac 
tggcacccgt 



ctcgaggaca 
aaacccaaag 
aagtacctcg 
gcggccctcg 
ctgcggtata 
gggggcaacc 
ctggttgagg 
caagaaccag 
ctcaattttg 
cctccagcaa 
atggcagaca 
gattccacat 
acctacaata 
aaccactact 
catttctcac 
agactcaact 
acgaccatcg 
cttccgtacg 
ttcatgattc 
tcatcctttt 
accttcagct 
ctggaccggc 
aatcagtccg 
atgtctgttc 
aaaacaaaaa 
ctcaatgggc 
gaagacaagt 
tcaaacactg 
cctgtggcca 
cctgcgaccg 
gacgtgtacc 
ccgtctcctc 
aacacacctg 
atcacccagt 
aacagtaagc 
gttgacttca 
tacctcaccc 



acctctctga 
ccaaccagca 
gacccttcaa 
agcacgacaa 
accacgccga 
tcgggcgagc 
aaggtgctaa 
actcctcctc 
gtcagactgg 
cccccgctgc 
ataacgaagg 
ggctgggcga 
accacctcta 
tcggctacag 
cacgtgactg 
tcaagctctt 
ctaataacct 
tcctcggctc 
cgcagtacgg 
actgcctgga 
acacctttga 
tgatgaatcc 
gaagtgccca 
agcccaaaaa 
cagacaacaa 
gtgaatccat 
tctttcccat 
cattggacaa 
ccgaaagatt 
gagatgtgca 
tgcagggtcc 
ttatgggcgg 
ttcctgcgaa 
actccacagg 
gctggaatcc 
ctgtggacaa 
gtcccctgta 



gggcattcgc 
aaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
gggcattggc 
cgactcagag 
tttgggacct 
cgccgacgga 
cagagtcatc 
caagcaaatc 
caccccctgg 
gcagcgactc 
caacatccaa 
taccagcacg 
tgcgcaccag 
ctacctgacg 
atatttccct 
ggaagtgcct 
tctcatcgac 
aaacaaggac 
ctggctacct 
caacagcaat 
catcaaccct 
gagcggtgtc 
tgtcatgatt 
tgggaccgtg 
tgttatggga 
catttgggcc 
ctttggactc 
tcctccggca 
acaagtgagc 
cgaagtgcag 
caatggactt 



<210> 20 
<211> 2970 
<212> DNA 

<213> Artificial sequence 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 20 

agatgaccgc caaggtcgta gagtccgcca aggccattct gggcggcagc aaggtgcgcg 
tggaccaaaa atgcaaggcc tctgcgcaga tcgaccccac ccccgtgatc gtcacctcca 
acaccaacat gtgcgccgtg attgacggga acagcaccac cttcgagcac cagcagcccc 
tgcaggaccg gatgttcaag tttgaactca cccgccgcct cgactacgac tttggcaagg 
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tcaccaagca ggaagtcaag gactttttcc ggtgggcggc tgatcacgtg actgacgtgg 300 

ctcatgagtt ttacgtcaca aagggtggag ctaagaaaag gcccgccccc tctgacgagg 360 

atataagcga gcccaagcgg ccgcgcgtgt catttgcgca gccggagacg tcagacgcgg 420 

aagctcccgg agacttcgcc gacaggtacc aaaacaaatg ttctcgtcac gcgggtatgc 480 

tgcagatgct ctttccctgc aagacgtgcg agagaatgaa tcagaattcc aacgtctgct 540 

tcacgcacgg tcagaaagat tgcggggagt gctttcccgg gtcagaatct caaccggttt 600 

ctgtcgtcag aaaaacgtat cagaaactgt gcatccttca tcagctccgg ggggcacccg 660 

agatcgcctg ctctgcttgc gaccaactca accccgattt ggacgattgc caatttgagc 720 

aataaatgac tgaaatcagg tatggctgct gacggttatc ttccagattg gctcgaggac 780 

aacctctctg aaggcattcg cgagtggtgg gcgctgaaac ctggagctcc acaacccaag 840 

gccaaccaac agcatcagga caacggcagg ggtcttgtgc ttcctgggta caagtacctc 900 

ggacccttca acggactcga caagggagag ccggtcaacg aggcagacgc cgcggccctc 960 

gagcacgaca aggcctacga caagcagctc gagcaggggg acaacccgta tctcaagtac 1020 

aaccacggcg acgccgagtt ccagcagcgc ttggcgaccg acacctcttt tgggggcaac 1080 

ctcgggcgag cagtcttcca ggccaaaaag aggattctcg agcctctggg tctggttgaa 1140 

gagggcgtta aaacggctcc tggaaagaaa cgcccattag aaaagactcc aaatcggccg 1200 

accaacccgg actctgggaa ggccccggcc aagaaaaagc aaaaagacgg cgaaccagcc 1260 

gactctgcta gaaggacact cgactttgaa gactctggag caggagacgg accccctgag 1320 

ggatcatctt ccggagaaat gtctcatgat gctgagatgc gtgcggcgcc aggcggaaat 1380 

gctgtcgagg cgggacaagg tgccgatgga gtgggtaatg cctccggtga ttggcattgc 1440 

gattccacct ggtcagaggg ccgagtcacc accaccagca cccgaacctg ggtcctaccc 1500 

acgtacaaca accacctgta cctgcgaatc ggaacaacgg ccaacagcaa cacctacaac 1560 

ggattctcca ccccctgggg atactttgac tttaaccgct tccactgcca cttttcccca 1620 

cgcgactggc agcgactcat caacaacaac tggggactca ggccgaaatc gatgcgtgtt 1680 

aaaatcttca acatacaggt caaggaggtc acgacgtcaa acggcgagac tacggtcgct 1740 

aataacctta ccagcacggt tcagatcttt gcggattcga cgtatgaact cccatacgtg 1800 

atggacgccg gtcaggaggg gagctttcct ccgtttccca acgacgtctt tatggttccc 1860 

caatacggat actgcggagt tgtcactgga aaaaaccaga accagacaga cagaaatgcc 1920 

ttttactgcc tggaatactt tccatcccaa atgctaagaa ctggcaacaa ttttgaagtc 1980 

agttaccaat ttgaaaaagt tcctttccat tcaatgtacg cgcacagcca gagcctggac 2040 

agaatgatga atcctttact ggatcagtac ctgtggcatc tgcaatcgac cactaccgga 2100 

aattccctta atcaaggaac agctaccacc acgtacggga* aaattaccac tggagacttt 2160 

gcctactaca ggaaaaactg gttgcctgga gcctgcatta aacaacaaaa attttcaaag 2220 

aatgccaatc aaaactacaa gattcccgcc agcgggggag acgccctttt aaagtatgac 2280 

acgcatacca ctctaaatgg gcgatggagt aacatggctc ctggacctcc aatggcaacc 2340 

gcaggtgccg gggactcgga ttttagcaac agccagctga tctttgccgg acccaatccg 2400 

agcggtaaca cgaccacatc ttcaaacaat ttgttgttta cctcagaaga ggagattgcc 2460 

acaacaaacc cacgagacac ggacatgttt ggacagattg cagataataa tcaaaatgcc 2520 

accaccgccc ctcacatcgc taacctggac gctatgggaa ttgttcccgg aatggtctgg 2580 

caaaacagag acatctacta ccagggccct atttgggcca aggtccctca cacggacgga 2640 

cactttcacc cttcgccgct gatgggagga tttggactga aacacccgcc tccacagatt 2700 

ttcatcaaaa acacccccgt acccgccaat cccaatacta cctttagcgc tgcaaggatt 2760 

aattcttttc tgacgcagta cagcaccgga caagttgccg ttcagatcga ctgggaaatt 2820 

cagaaggagc attccaaacg ctggaatccc gaagttcaat ttacttcaaa ctacggcact 2880 

caaaattcta tgctgtgggc tcccgacaat gctggcaact accacgaact ccgggctatt 2940 

gggtcccgtt tcctcaccca ccacttgtaa 2970 

<210> 21 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 21 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 
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Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 . 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

115 120 , 125 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 175 

Gly Asp ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 190 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 

210 215 220 

ser Gly Asn Trp His cys Asp Ser Thr Trp Leu Gly Asp Arg val lie 
225 230 235 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

245 250 255 

Tyr Lys Gin lie Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 

260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 315 320 

val Lys Glu val Thr Thr Ser Asp Gly Val Thr Thr He Ala Asn Asn 

325 330 335 

Leu Thr Ser Thr val Gin Val Phe Ser Asp ser Glu Tyr Gin Leu pro 

340 345 350 

Tyr val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe pro Ala 

355 360 365 

asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

ser Gin Ala Val Gly Arg Ser ser Phe Tyr cys Leu Glu Tyr Phe pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe ser Tyr Thr Phe 

405 410 415 

Glu Glu val Pro Phe His Ser Ser Tyr Ala His ser Gin ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 

450 455 460 

Arq Gly ser Pro Ala Gly Met ser val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg val Ser Lys Thr Lys Thr Gly Asn 

485 " 490 495 

Asn Asn ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 n , 510 . 

Gly His Glu ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met ser Gly val Met He Phe Gly 

530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn val Met He 
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545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 

565 570 . 575 

Phe Gly Thr Val Ala Val Asn Phe Gin ser ser Ser Thr His Pro Ala 

580 585 590 

Thr Gly Asp val His val Met Gly Ala Leu Pro Gly Met val Trp Gin 

595 600 605 

Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe ser Ala Thr Lys Phe Ala Ser Phe lie Thr 

660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ser val Glu lie Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu val Gin Tyr Thr ser Asn 

690 695 700 

Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 22 
<211> 736 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 22 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 . 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

K 100 105 HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 

115 120 125 

Phe Gly Leu val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

pro val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 , 175 

Gly Asp ser Glu ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 190 

Ala Thr Pro Ala Ala val Gly Pro Thr Thr Met Ala ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn Ala 

210 215 220 , , 

Ser Gly Asn Trp His cys Asp Ser Thr Trp Leu Gly Asp Arg val He 
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225 230 235 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu pro Thr Tyr Asn Asn His Leu 

245 250 255 

Tyr Lys Gin lie ser ser Ala ser Thr Gly Ala Ser Asn Asp Asn His 

260 265 270 

Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn phe Lys Leu Phe Asn lie Gin 
305 310 315 320 

val Lys Glu val Thr Thr Ser Asp Gly val Thr Thr He Ala Asn Asn 

325 330 335 

Leu Thr Ser Thr Val Gin val Phe ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr val Leu Gly Ser Ala His Gin Gly cys Leu Pro Pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

ser Gin Ala val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 

405 410 415 

Glu Glu val Pro Phe His Ser Ser Tyr Ala His Ser Gin ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe ser 

450 455 460 

Arg Gly ser Pro Ala Gly Met Ser val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg val Ser Lys Thr Lys Thr Gly Asn 

485 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala ser Lys Tyr Asn Leu Asn 

500 505 510 

Gly His Glu ser lie lie Asn Pro Gly Thr Ala Met Ala ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly val Met lie Phe Gly 

530 535 540 

Lys Glu ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn val Met lie 
545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala Val Asn Phe Gin ser ser Ser Thr His Pro Ala 

580 585 590 

Thr Gly Asp val His Val Met Gly Ala Leu Pro Gly Met val Trp Gin 

595 600 605 

Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro val Pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe lie Thr 

660 665 670 

Gin Tyr ser Thr Gly Gin Val Ser val Glu lie Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr ser Asn 

690 695 700 

Tyr Ala Lys ser Ala Asn val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu Pro Arg pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 
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<210> 23 
<211> 733 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 23 

Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu 

15 10 15 

Gly Val Arg Glu Trp Trp Ala Leu Gin pro Gly Ala Pro Lys Pro Lys 

20 25 30 

Ala Asn Gin Gin His Gin Asp Asn Ala Arg Gly Leu val Leu Pro Gly 

35 40 45 

Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro Val 

50 55 60 

Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin 
65 70 75 n 80 

Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 

Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly Asn 

100 105 110 

Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg lie Leu Glu Pro Leu 

115 120 125 

Gly Leu val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg Pro 

130 135 140 

Val Glu Gin Ser Pro Ala Glu Pro Asp ser Ser Ser Gly lie Gly Lys 
145 150 155 160 

Ser Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr Gly 

165 170 175 

Asp Thr Glu Ser val Pro Asp Pro Gin Pro Leu Gly Gin pro Pro Ala 

180 185 190 

Ala Pro ser Gly Val Gly ser Thr Thr Met Ala Ser Gly Gly Gly Ala 

195 200 205 

Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn Ser Ser 

210 215 220 

Gly Asn Trp His Cys Asp ser Gin Trp Leu Gly Asp Arg Val lie Thr 
225 230 235 240 

Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr 

245 250 255 

Lys Gin lie ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Tyr Phe 

260 265 270 

Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His cys 

275 280 285 

His Phe ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly 

290 295 300 

Phe Arg pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin val Lys 
305 310 315 320 

Glu val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu Thr 

325 330 335 

Ser Thr val Gin val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr val 

340 345 350 

Leu Gly ser Ala His Gin Gly cys Leu Pro Pro Phe Pro Ala Asp val 

355 360 365 

Phe Met val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly ser Gin 

370 375 380 

Ala val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe pro ser Gin 
385 390 395 400 

Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr Phe Glu Asp 




410 



415 
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val Pro Phe His Ser Ser Tyr Ala His Ser Gin ser Leu Asp Arg Leu 

420 425 430 

Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr Gin 

435 440 445 

Thr Ala Ser Gly Thr Gin Gin Ser Arg Leu Leu Phe Ser Gin Ala Gly 

450 455 460 

pro Thr Ser Met ser Leu Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 

Tvr Arq Gin Gin Arg Leu ser Lys Gin Ala Asn Asp Asn Asn Asn Ser 

485 490 495 

Asn Phe Pro Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 

500 505 510 

Ser Leu val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp Asp Lys 

515 520 525 

Glu Lys Phe Phe Pro Met His Gly Thr Leu He Phe Gly Lys Glu Gly 

530 535 540 

Thr Asn Ala Asn Asn Ala Asp Leu Glu Asn val Met lie Thr Asp Glu 
545 550 555 560 

Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Thr 

565 570 575 

Val Ser Asn Asn Leu Gin Asn Ser Asn Ala Gly Pro Thr Thr Gly Thr 

580 585 590 

val Asn His Gin Gly Ala Leu Pro Gly Met val Trp Gin Asp Arg Asp 

595 600 605 

val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly 

610 615 620 

His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 

pro Pro Gin lie Met lie Lys Asn Thr Pro val Pro Ala Asn Pro Pro 

645 650 n 655 

Thr Asn Phe Ser Ala Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser 

660 665 , 670 

Thr Gly Gin val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 

675 680 685 

Ser Lvs Arq Trp Asn Pro Glu lie Gin Tyr Thr ser Asn Tyr Asn Lys 

690 695 700 

Ser Val Asn Val Asp Phe Thr val Asp Thr Asn Gly val Tyr Ser Glu 
705 710 715 720 

Pro Arq Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 

<210> 24 
<211> 736 
<212> PRT 

<213> Artificial Sequence 

<220> . ... . 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

Met°Ala 4 Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 !■> 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 ° u 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

100 105 n HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 

115 120 125 

Phe Gly Leu val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro Val Glu Gin ser Pro Gin Glu Pro Asp ser Ser ser Gly lie Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

Gly Asp ser Glu ser val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 1?0 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 

210 215 220 

Ser Gly Asn Trp His cys Asp Ser Thr Trp Leu Gly Asp Arg val lie 
225 230 235 . 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

245 250 255 m 

Tyr Lys Gin lie Ser Ser Ala Ser Thr Gly Ala ser Asn Asp Asn His 

y 260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 315 320 

Val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

325 330 335 

Leu Thr ser Thr Val Gin val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

Ser Gin Ala val Gly Arg ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe ser Tyr Thr Phe 

405 410 415 

Glu Glu val Pro Phe His Ser ser Tyr Ala His Ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin Ser Gly ser Ala Gin Asn Lys Asp Leu Leu Phe ser 

450 455 460 

Arg Gly ser Pro Ala Gly Met ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg val Ser Lys Thr Lys Thr Asp Asn 

485 490 495 

Asn Asn ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 510 m 

Gly Arg Glu ser He lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly val Met He Phe Gly 

530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met lie 
545 550 555 n 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala 

580 585 590 

Thr Gly Asp val His val Met Gly Ala Leu pro Gly Met val Trp Gin 
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595 600 605 

Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 

610 * 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin He Leu lie Lys Asn Thr Pro Val pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala ser Phe He Thr 

660 665 670 

Gin Tyr ser Thr Gly Gin val ser val Glu lie Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 

690 695 700 

Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 25 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 25 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 15 

Glu Gly lie Arg Lys Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

100 105 110 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 

115 ~ 120 125 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 175 

Gly Asp Ser Glu ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 190 

Ala Thr pro Ala Ala val Gly pro Thr Thr Met Ala Ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn Ala 

210 215 220 

ser Gly Asn Trp His cys Asp ser Thr Trp Leu Gly Asp Arg val lie 
225 230 235 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

245 250 255 

Tyr Lys Gin lie ser ser Ala ser Thr Gly Ala Ser Asn Asp Asn His 

260 265 270 

Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
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275 280 285 

His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 315 320 

val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

325 330 335 

Leu Thr Ser Thr Val Gin Val Phe ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr Val Leu Gly ser Ala His Gin Gly cys Leu Pro pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

Ser Gin Ala Val Gly Arg Ser ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr phe ser Tyr Thr Phe 

, n ^ 405 410 415 

Glu Glu val Pro Phe His Ser Ser Tyr Ala His ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin ser Gly ser Ala Gin Asn Lys Asp Leu Leu Phe ser 

450 455 460 

Arg Gly Ser Pro Ala Gly Met Ser val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro cys Tyr Arg Gin Gin Arg val Ser Lys Thr Lys Thr Asp Asn 

485 490 495 

Asn Asn Ser Asn phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 510 

Gly Arg Glu Ser He He Asn pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly 

530 535 540 

Lys Glu ser Ala Gly Ala. ser Asn Thr Ala Leu Asp Asn Val Met lie 
545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn pro val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala val Asn Phe Gin Ser Ser ser Thr Asp Pro Ala 

580 585 590 

Thr Gly Asp val His val Met Gly Ala Leu Pro Gly Met val Trp Gin 

595 600 605 

Asp Arg Asp val Tyr Leu Gin Gly Pro lie Trp Ala Lys He Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin He Leu lie Lys Asn Thr Pro val Pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala ser Phe lie Thr 

660 665 670 

Gin Tyr ser Thr Gly Gin val ser Val Glu lie Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn ser Lys Arg Trp Asn pro Glu val Gin Tyr Thr Ser Asn 

690 695 700 

Tyr Ala Lys ser Ala Asn val Asp Phe Thr Val Asp Asn Asn Gly Leu 
70S 710 715 720 

Tyr Thr Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 26 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 26 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

} 5 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
6 * 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr ser Phe Gly Gly 

100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

u 115 , . 120 12 5 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arq 

130 135 140 

Pro Val Glu Gin ser Pro Gin Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

, 165 170 175 

Gly Asp ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

- , 1?0 185 190 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 

2 10 215 220 

ser Gly Asn Trp His cys Asp ser Thr Trp Leu Gly Asp Arg Val He 
225 , , 2 30 235 240 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

, , 2 *5 250 * 255 

Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 

, 260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 315 320 

val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

325 330 335 

Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr val Leu Gly ser Ala His Gin Gly Cys Leu pro Pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

ser Gin Ala Val Gly Arg Ser ser Phe Tyr cys Leu Glu Tyr Phe pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 

, , , 405 410 415 

Glu Glu val Pro phe His Ser ser Tyr Ala His Ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin ser Gly ser Ala Gin Asn Lys Asp Leu Leu Phe ser 
450 455 460 
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Arq Gly ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro cys Tyr Arg Gin Gin Arg val ser Lys Thr Lys Thr Asp Asn 

485 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 510 

Gly Arq Glu Ser lie He Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met He Phe Gly 

530 535 n 540. 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn val Met lie 
545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala val Asn Phe Gin ser Ser Ser Thr His Pro Ala 

580 585 590 

Thr Gly Asp val His val Met Gly Ala Leu Pro Gly Met val Trp Gin 

595 600 605 

Asp Arg Asp val Tyr Leu Gin Gly Pro lie Trp Ala Lys He Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 ^ 640 

Lys His Pro Pro Pro Gin He Leu lie Lys Asn Thr Pro Val Pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe ser Ala Thr Lys Phe Ala Ser Phe lie Thr 

660 665 670 

Gin Tyr Ser Thr Gly Gin val Ser val Glu He Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu val Gin Tyr Thr Ser Asn 

690 695 700 

Tyr Ala Lys Ser Ala Asn val Asp Phe Thr val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 27 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 27 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu ser 

15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 . 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr ser Phe Gly Gly 

100 105 HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg Val Leu Glu pro 

115 120 125 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 
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Pro val Glu Gin ser Pro Gin Glu pro Asp Ser Ser ser Gly lie Gly 
145 150 155 160 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

_ 165 170 175 

Gly Asp ser Glu ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

, , 180 185 190 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala ser Gly Gly Gly 

195 200 205 

Ala pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn Ala 

210 215 220 

ser Gly Asn Trp His cys Asp Ser Thr Trp Leu Gly Asp Arg val He 
225 230 235 " 240 

Thr Thr Ser Thr Arg Thr Trp Ala Leu pro Thr Tyr Asn Asn His Leu 

, , 245 250 255 

Tyr Lys Gin lie ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 

260 265 270 

Tyr phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His cys His Phe ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 
305 310 315 320 

Val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

, 325 330 335 

Leu Thr Ser Thr val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr val Leu Gly ser Ala His Gin Gly Cys Leu Pro pro Phe Pro Ala 

355 360 365 

Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

Ser Gin Ala val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 

405 410 415 

Glu Asp Val Pro Phe His Ser Ser Tyr Ala His ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe ser 

450 455 460 

Arg Gly Ser Pro Ala Gly Met ser val Gin Leu Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg val Ser Lys Thr Lys Thr Asp Asn 

485 ~ 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

n 500 505 510 

Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe pro Met Ser Gly val Met lie Phe Gly 

530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn val Met lie 
545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala val Asn Phe Gin Ser Ser ser Thr Asp Pro Ala 

, , 580 585 590 

Thr Gly Asp Val His val Met Gly Ala Leu Pro Gly Met val Trp Gin 

595 600 605 

Asp Arg Asp val Tyr Leu Gin Gly Pro He Trp Ala Lys lie Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro ser pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro val Pro Ala 
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645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala ser Phe He Thr 

660 665 670 

Gin Tyr ser Thr Gly Gin Val Ser val Glu lie Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn ser Lys Arg Trp Asn Pro Glu val Gin Tyr Thr Ser Asn 

690 695 700 

Tyr Ala Lys Ser Ala Asn val Asp Phe Thr val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 28 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 28 , 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 . 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 

100 105 110 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

115 120 125 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 

Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 175 

Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 190 

Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 

195 200 205 

Ala pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 

210 215 220 

ser Gly Asn Trp His Cys Asp ser Thr Trp Leu Gly Asp Arg val lie 
225 230 235 . 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

245 250 255 m 

Tyr Lys Gin lie ser ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 

y y 260 265 270 

Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 
305 310 315 320 

val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

Page 32 



WO 2006/119432 PCT/US2006/017157 



140140433Pl.TXT 
325 330 335 

Leu Thr Ser Thr val Gin Val Phe ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr val Leu Gly ser Ala His Gin Gly cys Leu Pro Pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

Ser Gin Ala val Gly Arg ser ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe ser Tyr Thr Phe 

405 b 410 415 

Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 

450 455 460 

Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg val ser Lys Thr Lys Thr Asp Asn 

485 490 495 

Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 

500 505 510 

Gly Arg Glu Ser lie He Asn Pro Gly Thr Ala Met Ala Ser His Lys 

515 520 525 

Asp Asp Glu Asp Lys Phe Phe Pro Met ser Gly val Met lie Phe Gly 

530 535 540 

Lys Gly ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met lie 
545 550 555 560 

Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 

565 570 575 

Phe Gly Thr val Ala val Asn Phe Gin Ser ser Ser Thr Asp Pro Ala 

580 585 590 

Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gin 

595 600 605 

Asp Arg Asp val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 

610 615 620 

Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr pro Val Pro Ala 

645 650 655 

Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe lie Thr 

660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ser val Glu He Glu Trp Glu Leu Gin 

675 680 685 

Lys Glu Asn ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 

690 695 700 

Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 

Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 29 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 29 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
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Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 

35 40 " 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 " 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr ser Phe Gly Gly 

100 ~ 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro 

115 120 ' * 125 

Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser ser Gly lie Gly 
145 150 155 160 

Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 

165 170 175 

Gly Asp ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 

180 185 190 

Ala Thr Pro Ala Ala Leu Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 

195 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn Ala 

210 215 220 

ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg val He 
225 230 235 240 

Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 

245 250 255 

Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala ser Asn Asp Asn His 

260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

275 280 285 

His cys His Phe ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 

290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin 
305 310 > 315 320 

Val Lys Glu val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 

325 330 335 

Leu Thr Ser Thr val Gin val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 

340 345 350 

Tyr val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 

355 360 365 

Asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 

370 375 380 

Ser Lys Ala val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 

405 410 415 

Glu Glu val Pro Phe His Ser ser Tyr Ala His Ser Gin Ser Leu Asp 

420 425 430 

Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 

435 440 445 

Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 

450 455 460 

Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 

485 490 495 

Asn Asn ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 



500 




510 
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> Asn Pro Gly Thr Ala Met Ala Ser His Lys 
520 525 

.._ ,.. r , ... ...» phe Pro Met ser Gly val Met lie Phe Gly 

535 535 , 540 

w „ Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met lie 
545 550 555 560 

" » Lys Ala Thr Asn Pro val Ala Thr Glu Arg 
570 575 
I Asn Phe Gin ser ser Ser Thr Asp Pro Ala 

585 590 
1 Met Gly Ala Leu pro Gly Met val Trp Gin 
600 60S 



GIU 


Ser 


He 


515 






Glu 


ASD 


Lys 


Ser 


Ala 


Gly 


Glu 


Glu 


Glu 






565 


Thr 


val 


Ala 




580 




Asp 


Val 


His 


595 






Asp 


val 


Tvr 


Gly 


His 


Phe 


pro 


Pro 


Pro 






645 


Pro 


Ala 


Glu 




660 




Ser 


Thr 


Gly 


675 




Asn 


Ser 


Lys 


Lys 


Ser 


Ala 


Glu 


Pro 


Arg 






725 



625 J 630 635 * ^ 640 

"n He Leu He Lys Asn Thr Pro val Pro Ala 
650 655 
e Ser Ala Thr Lys Phe Ala Ser Phe lie Thr 

665 670 
n val Ser val Glu lie Glu Trp Glu Leu Gin 

680 685 
g Trp Asn Pro Glu. Val Gin Tyr Thr ser Asn 
690 695 700 

,v, Ala Lys Ser Ala Asn val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 710 715 720 

o lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
730 735 

<210> 30 
<211> 742 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 30 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

15 10 15 

Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gin Pro 

20 25 30 

Lys Ala Asn Gin Gin His Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Gin Arg Leu Ala Thr Asp Thr ser Phe Gly Gly 

100 105 HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg lie Leu Glu Pro 

115 120 125 

Leu Gly Leu val Glu Glu Gly val Lys Thr Ala Pro Gly Lys Lys Arg 

130 135 140 

Pro Leu Glu Lys Thr Pro Asn Arg Pro Thr Asn Pro Asp Ser Gly Lys 
145 150 155 160 

Ala Pro Ala Lys Lys Lys Gin Lys Asp Gly Glu Pro Ala Asp Ser Ala 

165 170 175 

Arg Arg Thr Leu Asp Phe Glu Asp ser Gly Ala Gly Asp Gly Pro Pro 
180 185 190 
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Glu Gly ser Ser Ser Gly Glu Met Ser His Asp Ala Glu Met Arg Ala 

195 200 205 

Ala Pro Gly Gly Asn Ala Val Glu Ala Gly Gin Gly Ala Asp Gly Val 

210 215 220 

Gly Asn Ala Ser Gly Asp Trp His cys Asp ser Thr Trp ser Glu Gly 
225 230 235 240 

Arg val Thr Thr Thr Ser Thr Arg Thr Trp val Leu Pro Thr Tyr Asn 

245 250 255 

Asn His Leu Tyr Leu Arg lie Gly Thr Thr Ala Asn Ser Asn Thr Tyr 

260 265 270 

Asn Gly Phe Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 

275 280 285 

Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 

290 295 300 

Gly Leu Arg Pro Lys Ser Met Arg val Lys lie Phe Asn He Gin Val 
305 310 315 320 

Lys Glu val Thr Thr ser Asn Gly Glu Thr Thr val Ala Asn Asn Leu 

325 330 335 

Thr Ser Thr Val Gin lie Phe Ala Asp Ser Thr Tyr Glu Leu Pro Tyr 

340 345 350 

val Met Asp Ala Gly Gin Glu Gly Ser Phe Pro Pro Phe Pro Asn Asp 

355 360 365 

val Phe Met Val Pro Gin Tyr Gly Tyr Cys Gly Val Val Thr Gly Lys 

370 375 380 

Asn Gin Asn Gin Thr Asp Arg Asn Ala Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 

Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu val Ser Tyr Gin 

405 410 415 

Phe Glu Lys val Pro Phe His Ser Met Tyr Ala His ser Gin Ser Leu 

420 425 430 

Asp Arg Met Met Asn Pro Leu Leu Asp Gin Tyr Leu Trp His Leu Gin 

435 440 445 

Ser Thr Thr Thr Gly Asn Ser Leu Asn Gin Gly Thr Ala Thr Thr Thr 

450 455 460 

Tyr Gly Lys He Thr Thr Gly Asp Phe Ala Tyr Tyr Arg Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Ala Cys lie Lys Gin Gin Lys Phe ser Lys Asn Ala Asn 

485 490 495 

Gin Asn Tyr Lys lie Pro Ala Ser Gly Gly Asp Ala Leu Leu Lys Tyr 

500 505 510 

Asp Thr His Thr Thr Leu Asn Gly Arg Trp Ser Asn Met Ala Pro Gly 

515 520 525 

Pro Pro Met Ala Thr Ala Gly Ala Gly Asp Ser Asp Phe Ser Asn ser 

530 535 540 

Gin Leu lie Phe Ala Gly Pro Asn Pro ser Gly Asn Thr Thr Thr ser 
545 550 555 560 

Ser Asn Asn Leu Leu Phe Thr Ser Glu Glu Glu He Ala Thr Thr Asn 

565 570 575 

Pro Arg Asp Thr Asp Met Phe Gly Gin lie Ala Asp Asn Asn Gin Asn 

580 585 590 

Ala Thr Thr Ala Pro His lie Ala Asn Leu Asp Ala Met Gly lie Val 

595 600 605 

Pro Gly Met val Trp Gin Asn Arg Asp lie Tyr Tyr Gin Gly Pro lie 

610 615 620 

Trp Ala Lys val Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 
625 630 635 640 

Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Phe lie Lys 

645 650 655 

Asn Thr Pro Val Pro Ala Asn Pro Asn Thr Thr Phe Ser Ala Ala Arg 

660 665 670 

lie Asn Ser Phe Leu Thr Gin Tyr Ser Thr Gly Gin Val Ala val Gin 

675 680 685 

lie Asp Trp Glu lie Gin Lys Glu His Ser Lys Arg Trp Asn Pro Glu 
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690 695 700 

val Gin Phe Thr ser Asn Tyr Gly Thr Gin Asn Ser Met Leu Trp Ala 
705 710 715 720 

Pro Asp Asn Ala Gly Asn Tyr His Glu Leu Arg Ala lie Gly Ser Arg 

725 730 735 

Phe Leu Thr His His Leu 
740 

<210> 31 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 31 

gcgacakttt gcgacaccay gtgg 24 

<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> mi sc_feature 
<222> 4-6 

<223> N can be a, g, c, or t (u) 
<400> 32 

ccannnggaa tcgcaatgcc aat 23 

<210> 33 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 4, 6, 9 

<223> N can be a, g, c, or t (u) 
<400> 33 

atgntnatnt ggtgggagga ggg 23 

<210> 34 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 6 
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<223> N can be a, g, c, or t (u) 
<400> 34 

cgaatnaamc ggtttattga ttaac 25 

<210> 35 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 35 

Asn Gly Arg Ala His Ala 



<210> 36 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 36 

Ser He Gly Tyr Pro Leu Pro 
1 5 

<210> 37 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 37 

Lys Phe Asn Lys Pro Phe Val Phe Leu He 
15 10 

<210> 38 
<211> 22 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 38 

Asn lie Ser Leu Asp Asn Pro Leu Glu Asn Pro ser ser Leu Phe Asp 
15 10 15 

Leu Val Ala Arg lie Lys 
20 

<210> 39 
<211> 26 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 14 

<223> N can be a, g, c, or t (u) 
<400> 39 

caataaaccg kktnattcgt ktcagt 26 

<210> 40 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 4-5, 21 

<223> N can be a, g, c f or t (u) 
<400> 40 

acannwgagt cagaaatkcc nggcag 26 

<210> 41 
<211> 640 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 608, 624 

<223> N can be a, g, c, or t (u) 
<400> 41 

gcgaattgaa tttagcggcc gcgaattcgc ccttcaataa accgtttgat tcgtgtcagt 60 

tgatctttgg cctacttgtc cttcttatct tatctggtcg ccatggctgc gtagataagc 120 

agcttggtat gcgcttcgcg gttaatcatc aactacgcca aaccctagat gatggagttg 180 

gccactccct ctatgcgcgc tcgctcgctc ggtggggccg gactgcccgg catttctgac 240 

tcctttgtga actgggtggc cctccgggct attgggtccc gtttcctcac ccaccacttg 300 

taacccttcc tggttaatca atgaaccgtt taattcgttt cagttgatct ttggcctact 360 

tgtccttctt atcttatctg gttgccatgg ctgcgtagat aagcagcttg gtatgcgctt 420 

cgcggttaat catcaactac gccaaaccct agatgatgga gttggccact ccctctatgc 480 

gcgctcgctc gctcggtggg gccggactgc cgggcatttc tgactcactt gtaagggcga 540 

aattcgttta aacctgcagg actagtccct ttagtgaggg ttaattctga gcttggcgta 600 

atcatggnca tagctgtttc ctgnggaaaa tgttatccgc 640 

<210> 42 
<211> 248 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 
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<400> 42 

tggaggggtg gagtcgtgrc gtgaattacg tcatagggtt agggaggtcc tgtattagag 60 

gtcacgtgag tgttttgcga cattttgcga caccatgtgg tcacgctggg tatttaagcc 120 

cgagtgwgca cgcagggtct ccattttgaa gcgggaggtt tgaacgcgca gccgccatgc 180 

cggggtttta cgagattgtg attaaggtcc ccagcgacct tgacgagcat ctgccaggaa 240 

tttctgac ^ 248 

<210> 43 
<211> 317 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 43 

gcccgggcgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 60 

aggggttcct ggaggggtgg agtcgtgacg tgaattacgt catagggtta gggaggtcct 120 

gtattagagg tcacgtgagt gttttgcgac attttgcgac accatgtggt cacgctgggt 180 

atttaagccc gagtgagcac gcagggtctc cattttgaag cgggaggttt gaacgcgcag 240 

ccgccatgcc ggggttttac gagattgtga ttaaggtccc cagcgacctt gacgagcatc 300 

tgcctggaat ttctgac ~ 317 

<210> 44 
<211> 157 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 33, 105, 141 
<223> N can be a, g, c, or t (u) 

<400> 44 

gagtgagcga gcagggtctc cattttgacc gcnaaatttg aacgagcagc atccatgccg 60 

ggcttctacg agatcgtgat caaggtgccg agcgacctgg acgancacct gcctggcatt 120 

tctgactcaa ttgtaagggc naattcgttt aaacctg 157 

<210> 45 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 45 

gcgacakttt gcgacaccay gtgg 24 

<210> 46 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 
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<221> mis cofeature 
<222> 4-6 

<223> N can be a, g, c, or t (u) 
<400> 46 

ccannnggaa tcgcaatgcc aat 23 

<210> 47 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 4, 6, 9 

<223> N can be a, g, c, or t (u) 
<400> 47 

atgntnatnt ggtgggagga ggg 23 

<210> 48 
<2U> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<221> misc_feature 
<222> 6 

<223> N can be a, g, c, or t (u) 
<400> 48 

cgaatnaamc ggtttattga ttaac 25 

<210> 49 
<211> 1872 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 49 

atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg 60 

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat 120 

tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga gaagctgcag 180 

cgcgacttcc tggtccaatg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt 240 

cagttcgaga agggcgagtc ctacttccac ctccatattc tggtggagac cacgggggtc 300 

aaatccatgg tgctgggccg cttcttgagt cagattaggg acaagctggt gcagaccatc 360 

taccgcggga tcgagccgac cctgcccaac tggttcgcgg tgaccaagac gcgtaatggc 420 

gccggagggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag 480 

actcagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgtttg 540 

aacctggccg agcgcaaacg gctcgtggcg cagcacctga cccacgtcag ccagacccag 600 

gagcagaaca aggagaatct gaaccccaat tctgacgcgc ctgtcatccg gtcaaaaacc 660 

tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag 720 

cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg 780 

tcccagatca aggccgctct ggacaatgcc ggcaagatca tggcgctgac caaatccgcg 840 
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cccgactacc tggtaggccc cgctcctccc gcggacatta aaaccaaccg catctaccgc 900 

atcctggagc tgaacggcta cgaccctgcc tacgccggct ccgtctttct cggctgggcc 960 

cagaaacggt tcgggaagcg caacaccatc tggctgtttg ggccggccac cacgggcaag 1020 

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt caactggacc 1080 

aatgagaact ttcccttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc 1140 

aagatgacgg ccaaggtcgt ggagtccgcc aaggccattc tcggcggcag caaggtgcgc 1200 

gtggaccaaa agtgcaagtc gtccgcccag atcgatccca cccccgtgat cgtcacctcc 1260 

aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccg 1320 

ttgcaggacc ggatgttcaa atttgaactc acccgccgtc tggagcatga ctttggcaag 1380 

gtgacaaagc aggaagtcaa agagttcttc cgctgggcgc aggatcacgt gaccgaggtg 1440 

gcgcatgagt tctacgtcag aaagggtgga gccaacaaaa gacccgcccc cgatgacgcg 1500 

gataaaagcg agcccaagcg ggcctgcccc tcagtcgcgg atccatcgac gtcagacgcg 1560 

gaaggagctc cggtggactt tgccgacagg taccaaaaca aatgttctcg tcacgcgggc 1620 

atgcttcaga tgctgtttcc ctgcaagaca tgcgagagaa tgaatcagaa tttcaacatt 1680 

tgcttcacgc acgggaccag agactgttca gaatgtttcc ccggcgtgtc agaatctcaa 1740 

ccggtcgtca gaaaaaggac gtatcggaaa ctctgtgcga ttcatcatct gctggggcgg 1800 

gctcccgaga ttgcttgctc ggcctgcgat ctggtcaacg tggacctgga tgactgtgtc 1860 

tctgagcaat aa 1872 

<210> 50 
<211> 623 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 50 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp ser Phe val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His lie Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 ~ 110 

Arg Asp Lys Leu Val Gin Thr lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 ' 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 

165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu val Ala Gin His 

180 185 190 

Leu Thr His val ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro Val lie Arg ser Lys Thr ser Ala Arg Tyr 

210 215 220 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

ser Asn Ser Arg ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 
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Ile wet Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 

275 280 285 

Pro Pro Ala Asp lie Lys Thr Asn Arg lie Tyr Arg He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly ser val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Arg Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 ~ 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val Pro 

340 345 350 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

cys val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val val Glu ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 

val Asp Gin Lys cys Lys Ser ser Ala Gin lie Asp Pro Thr Pro Val 

405 410 415 

lie val Thr Ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu Val Lys Glu Phe Phe Arg Trp Ala Gin Asp His val Thr Glu Val 
465 470 475 480 

Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 

485 490 495 

Pro Asp Asp Ala Asp Lys ser Glu Pro Lys Arg Ala Cys Pro Ser val 

500 505 510 

Ala Asp Pro Ser Thr ser Asp Ala Glu Gly Ala Pro val Asp Phe Ala 

515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu Gin Met 

530 535 ~ 540 

Leu Phe Pro cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn lie 
545 550 555 560 

Cys Phe Thr His Gly Thr Arg Asp cys Ser Glu Cys Phe Pro Gly Val 

565 570 575 

Ser Glu Ser Gin pro val Val Arg Lys Arg Thr Tyr Arg Lys Leu cys 

580 585 590 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys Ser Ala 

595 600 605 

Cys Asp Leu val Asn Val Asp Leu Asp Asp Cys val ser Glu Gin 
610 615 620 

<210> 51 
<211> 1872 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Note = 
synthethic construct 

<400> 51 

atgccgggct tctacgagat cgtgatcaag gtgccgagcg acctggacga gcacctgccg 60 

ggcatttctg actcgtttgt gaactgggtg gccgagaagg aatgggagct gcccccggat 120 

tctgacatgg atctgaatct gattgagcag gcacccctga ccgtggccga gaagctgcag 180 

cgcgacttcc tggtccaatg gcgccgcgtg agtaaggccc cggaggccct cttctttgtt 240 

cagttcgaga agggcgagtc ctacttccac ctccatattc tggtggagac cacgggggtc 300 

aaatccatgg tgctgggccg cttcctgagt cagattaggg acaagctggt gcagaccatc 360 

taccgcggga tcgagccgac cctgcccaac tggttcgcgg tgaccaagac gcgtaatggc 420 

gccggagggg ggaacaaggt ggtggacgag tgctacatcc ccaactacct cctgcccaag 480 
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actcagcccg agctgcagtg ggcgtggact aacatggagg agtatataag cgcgtgtttg 
aacctggccg agcgcaaacg gctcgtggcg cagcacctga cccacgtcag ccagacccag 
gagcagaaca aggagaatct gaacccgaat tctgacgcgc ctgtcatccg gtcaaaaacc 
tccgcgcgct acatggagct ggtcgggtgg ctggtggacc ggggcatcac ctccgagaag 
cagtggatcc aggaggacca ggcctcgtac atctccttca acgccgcctc caactcgcgg 
tcccagatca aggccgctct ggacaatgcc ggcaagatca tggcgctaac caaatccgcg 
cccgactacc tggtaggccc cgctccgccc gcggacatta aaaccaaccg catttaccgc 
atcctggagc tgaacggcta cgaccctgcc tacgccggct ccgtctttct cggctgggcc 
cagaaaaggt tcgggaagcg caacaccatc tggctgtttg ggccggccac cacgggcaag 
accaacatcg cggaagccat cgcacacgcc gtgcccttct acggctgcgt caactggacc 
aatgaaaact ttcccttcaa cgactgcgtc gacaagatgg tgatctggtg ggaggagggc 
aagatgacgg ccaaggtcgt ggagtccgcc aaggccattc tcggcggcag caaggtgcgc 
gtggaccaaa agtgcaagtc gtccgcccag atcgatccca cccccgtgat cgtcacctcc 
aacaccaaca tgtgcgccgt gattgacggg aacagcacca ccttcgagca ccagcagccg 
ttgcaggacc ggatgttcaa atttgaactc acccgccgtc tggagcacga ctttggcaag 
gtgacaaagc aggaagtcaa agagttcttc cgctgggcgc aggatcacgt gaccgaggtg 
gcgcatgagt tctacgtcag aaagggtgga gccaacaaga gacccgcccc cgatgacgcg 
gataaaagcg agcccaagcg ggtctgcccc tcagtcgcgg atccatcgac gtcagacgcg 
gaaggagctc cggtggactt tgccgacagg taccaaaaca aatgttctcg tcacgcgggc 
atgcttcaga tgctgtttcc ctgcaaaaca tgcgagagaa tgaatcagaa tttcaacatt 
tgcttcacgc acgggaccag agactgttca gaatgtttcc ccggcgtgtc agaatctcaa 
ccggtcgtca gaaaaaggac gtatcggaaa ctctgtgcca ttcatcatct gctggggcgg 
gctcccgaga ttgcttgctc ggcctgcgat ctggtcaacg tggacctgga tgactgtgtt 
tctgagcaat aa 

<210> 52 
<211> 623 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 52 

Met Pro Gly Phe Tyr Glu lie Val lie Lys val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly He Ser Asp ser Phe Val Asn Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

val Gin Trp Arg Arg val ser Lys Ala Pro Glu Ala Leu Phe Phe val 
65 70 75 80 

Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His lie Leu Val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met val Leu Gly Arg Phe Leu Ser Gin lie 

100 105 110 

Arg Asp Lys Leu Val Gin Thr lie Tyr Arg Gly lie Glu Pro Thr Leu 

115 120 ~ 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 

165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro val lie Arg ser Lys Thr Ser Ala Arg Tyr 
210 215 220 
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Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

ser Asn ser Arg ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ala 

275 280 285 

Pro Pro Ala Asp lie Lys Thr Asn Arg lie Tyr Arg lie Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp pro Ala Tyr Ala Gly Ser val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Arg phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val Pro 

340 ■ 345 350 

Phe Tyr Gly Cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys Val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val val Glu ser Ala Lys Ala lie Leu Gly Gly ser Lys Val Arg 
385 390 395 400 

val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr Pro val 

405 410 415 

lie val Thr ser Asn Thr Asn Met Cys Ala Val lie Asp Gly Asn ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 

Glu val Lys Glu Phe Phe Arg Trp Ala Gin Asp His Val Thr Glu val 
465 470 475 480 

Ala His Glu Phe Tyr val Arg Lys Gly Gly Ala Asn Lys Arg Pro Ala 

485 ^ 490 495 

pro Asp Asp Ala Asp Lys Ser Glu pro Lys Arg val cys Pro Ser val 

500 505 510 

Ala Asp Pro ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 

515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu Gin Met 

530 535 ^ 540 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn lie 
545 550 ~ 555 560 

Cys Phe Thr His Gly Thr Arg Asp cys Ser Glu Cys Phe Pro Gly Val 

565 570 575 

Ser Glu ser Gin Pro val Val Arg Lys Arg Thr Tyr Arg Lys Leu Cys 

580 585 590 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys ser Ala 

595 600 605 

cys Asp Leu val Asn val Asp Leu Asp Asp Cys val ser Glu Gin 
610 615 620 

<210> 53 
<211> 2689 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 53 

ttgcgacagt ttgcgacacc atgtggtcac aagaggtata taaccgcgag tgagccagcg 60 
aggagctcca ttttgcccgc gaagtttgaa cgagcagcag ccatgccggg gttctacgag 120 
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gtggtgatca aggtgcccag cgacctggac gagcacctgc ccggcatttc tgactccttt 180 

gtgaactggg tggccgagaa ggaatgggag ttgcccccgg attctgacat ggatcagaat 240 

ctgattgagc aggcacccct gaccgtggcc gagaagctgc agcgcgagtt cctggtggaa 300 

tggcgccgag tgagtaaatt tctggaggcc aagttttttg tgcagtttga aaagggggac 360 

tcgtactttc atttgcatat tctgattgaa attaccggcg tgaaatccat ggtggtgggc 420 

cgctacgtga gtcagattag ggataaactg atccagcgca tctaccgcgg ggtcgagccc 480 

cagctgccca actggttcgc ggtcacaaag acccgaaatg gcgccggagg cgggaacaag 540 

gtggtggacg agtgctacat ccccaactac ctgctcccca aggtccagcc cgagcttcag 600 

tgggcgtgga ctaacatgga ggagtatata agcgcctgtt tgaacctcgc ggagcgtaaa 660 

cggctcgtgg cgcagcacct gacgcacgtc tcccagaccc aggagggcga caaggagaat 720 

ctgaacccga attctgacgc gccggtgatc cggtcaaaaa cctccgccag gtacatggag 780 

ctggtcgggt ggctggtgga caagggcatc acgtccgaga agcagtggat ccaggaggac 840 

caggcctcgt acatctcctt caacgcggcc tccaactccc ggtcgcagat caaggcggcc 900 

ctggacaatg cctccaaaat catgagcctc accaaaacgg ctccggacta tctcatcggg 960 

cagcagcccg tgggggacat taccaccaac cggatctaca aaatcctgga actgaacggg 1020 

tacgaccccc agtacgccgc ctccgtcttt ctcggctggg cccagaaaaa gtttggaaag 1080 

cgcaacacca tctggctgtt tgggcccgcc accaccggca agaccaacat cgcggaagcc 1140 

atcgcccacg cggtcccctt ctacggctgc gtcaactgga ccaatgagaa ctttcccttc 1200 

aacgactgcg tcgacaaaat ggtgatttgg tgggaggagg gcaagatgac cgccaaggtc 1260 

gtagagtccg ccaaggccat tctgggcggc agcaaggtgc gcgtggacca aaaatgcaag 1320 

gcctctgcgc agatcgaccc cacccccgtg atcgtcacct ccaacaccaa catgtgcgcc 1380 

gtgattgacg ggaacagcac caccttcgag caccagcagc ccctgcagga ccggatgttc 1440 

aagtttgaac tcacccgccg cctcgaccac gactttggca aggtcaccaa gcaggaagtc 1500 

aaggactttt tccggtgggc ggctgatcac gtgactgacg tggctcatga gttttacgtc 1560 

acaaagggtg gagctaagaa aaggcccgcc ccctctgacg aggatataag cgagcccaag 1620 

cggccgcgcg tgtcatttgc gcagccggag acgtcagacg cggaagctcc cggagacttc 1680 

gccgacaggt accaaaacaa atgttctcgt cacgcgggta tgctgcagat gctctttccc 1740 

tgcaagacgt gcgagagaat gaatcagaat tccaacgtct gcttcacgca cggtcagaaa 1800 

gattgcgggg agtgctttcc cgggtcagaa tctcaaccgg tttctgtcgt cagaaaaacg 1860 

tatcagaaac tgtgcatcct tcatcagctc cggggggcac ccgagatcgc ctgctctgct 1920 

tgcgaccaac tcaaccccga tttggacgat tgccaatttg agcaataaat gactgaaatc 1980 

aggtatggct gctgacggtt atcttccaga ttggctcgag gacaacctct ctgaaggcat 2040 

tcgcgagtgg tgggcgctga aacctggagc tccacaaccc aaggccaacc aacagcatca 2100 

ggacaacggc aggggtcttg tgcttcctgg gtacaagtac ctcggaccct tcaacggact 2160 

cgacaaggga gagccggtca acgaggcaga cgccgcggcc ctcgagcacg acaaggccta 2220 

cgacaagcag ctcgagcagg gggacaaccc gtatctcaag tacaaccacg ccgacgccga 2280 

gttccagcag cgcttggcga ccgacacctc ttttgggggc aacctcgggc gagcagtctt 2340 

ccaggccaaa aagaggattc tcgagcctct gggtctggtt gaagagggcg ttaaaacggc 2400 

tcctggaaag aaacgcccat tagaaaagac tccaaatcgg ccgaccaacc cggactctgg 2460 

gaaggccccg gccaagaaaa agcaaaaaga cggcgaacca gccgactctg ctagaaggac 2520 

actcgacttt gaagactctg gagcaggaga cggaccccct gagggatcat cttccggaga 2580 

aatgtctcat gatgctgaga tgcgtgcggc gccaggcgga aatgctgtcg aggcgggaca 2640 

aggtgccgat ggagtgggta atgcctccgg tgattggcat tgcgattcc 2689 

<210> 54 
<211> 621 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: Note = 
synthethic construct 

<400> 54 

Met Pro Gly Phe Tyr Glu Val val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp ser Phe val Asn Trp val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Gin Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Glu Phe Leu 

50 55 60 

val Glu Trp Arg Arg Val Ser Lys Phe Leu Glu Ala Lys Phe Phe val 
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65 70 75 n 80 

Gin Phe 61 u Lys Gly Asp Ser Tyr Phe His Leu His lie Leu lie Glu 

85 90 95 

lie Thr Gly val Lys Ser Met Val val Gly Arg Tyr val Ser Gin He 

100 105 110 

Arg Asp Lys Leu lie Gin Arg lie Tyr Arg Gly Val Glu Pro Gin Leu 

115 120 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 ^ 160 

val Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 

165 170 175 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu val Ala Gin His 

180 185 190 

Leu Thr His val Ser Gin Thr Gin Glu Gly Asp Lys Glu Asn Leu Asn 

195 200 205 

Pro Asn Ser Asp Ala Pro val lie Arg ser Lys Thr Ser Ala Arg Tyr 

210 215 ~ 220 

Met Glu Leu Val Gly Trp Leu Val Asp Lys Gly lie Thr Ser Glu Lys 
225 230 235 n 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala ser Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu lie Gly Gin Gin 

275 280 285 

Pro val Gly Asp lie Thr Thr Asn Arg He Tyr Lys lie Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala val Pro 

340 345 350 

Phe Tyr Gly cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

Cys val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val Val Glu Ser Ala Lys Ala He Leu Gly Gly ser Lys Val Arg 
385 390 395 400 

val Asp Gin Lys Cys Lys Ala ser Ala Gin He Asp Pro Thr Pro val 

H 405 410 415 

He val Thr ser Asn Thr Asn Met Cys Ala val lie Asp Gly Asn ser 

420 425 .430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Asp His Asp Phe Gly Lys Val Thr Lys Gin 

450 455 460 - 

Glu val Lys Asp Phe Phe Arg Trp Ala Ala Asp His val Thr Asp val 
465 470 475 480 

Ala His Glu Phe Tyr val Thr Lys Gly Gly Ala Lys Lys Arg Pro Ala 

485 490 495 

pro Ser Asp Glu Asp He Ser Glu Pro Lys Arg pro Arg val ser Phe 

500 505 n 510 

Ala Gin Pro Glu Thr Ser Asp Ala Glu Ala Pro Gly Asp Phe Ala Asp 

515 520 ^ 525 

Arg Tyr Gin Asn Lys Cys ser Arg His Ala Gly Met Leu Gin Met Leu 

530 535 , 540 

Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Ser Asn val cys 
545 550 555 ^ 560 

Phe Thr His Gly Gin Lys Asp Cys Gly Glu Cys Phe Pro Gly Ser Glu 
565 570 575 
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Ser Gin Pro val ser val val Arg Lys Thr Tyr Gin Lys Leu cys lie 

580 585 590 

Leu His Gin Leu Arg Gly Ala Pro Glu lie Ala cys Ser Ala Cys Asp 

595 600 605 

Gin Leu Asn Pro Asp Leu Asp Asp Cys Gin Phe Glu Gin 
610 615 620 
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